EU data mining hacks available for U.S.
- By Joab Jackson
- Sep 03, 2004
A U.K. university spinoff company is offering a set of algorithms developed by the European Union that might help U.S. agencies data mine more accurately.
Numerical Algorithms Group of London (http://www.nag.com) has released Version 2.0 of its Data Mining and Cleaning Components software package. This version makes use of results from a $4.6 million, three-year EU research project called Euredit
The Euredit work addressed shortcomings in current data mining techniques for handling missing or erroneous data, according to Stephen Langdell, a member of the Numerical Library Division of NAG. It also pioneered some techniques for better memory management.
The Euredit algorithms are already crunching socioeconomic data for statistical offices in Italy, the U.K. and Finland, as well as for a number of European universities, Langdell said.
The algorithms are written in ANSI C and have simple function interfaces that can be used to interface with existing or new applications. Users can write scripts or small programs for applying the algorithms to datasets, using programming or scripting languages such as Java, C#, Perl or Python.
The basic desktop developer edition of Data Mining and Cleaning Components costs $1,995. NAG offers versions for both Microsoft Windows and Linux.
Created in 1970, NAG is a collaborative venture among a number of British universities and government libraries to develop and sell libraries of numerical and statistical subroutines.
Joab Jackson is the senior technology editor for Government Computer News.