EU data mining hacks available for U.S.

A U.K. university spinoff company is offering a set of algorithms developed by the European Union that might help U.S. agencies data mine more accurately.

Numerical Algorithms Group of London ( has released Version 2.0 of its Data Mining and Cleaning Components software package. This version makes use of results from a $4.6 million, three-year EU research project called Euredit.

The Euredit work addressed shortcomings in current data mining techniques for handling missing or erroneous data, according to Stephen Langdell, a member of the Numerical Library Division of NAG. It also pioneered some techniques for better memory management.

The Euredit algorithms are already crunching socioeconomic data for statistical offices in Italy, the U.K. and Finland, as well as for a number of European universities, Langdell said.

The algorithms are written in ANSI C and have simple function interfaces that can be used to interface with existing or new applications. Users can write scripts or small programs for applying the algorithms to datasets, using programming or scripting languages such as Java, C#, Perl or Python.

The basic desktop developer edition of Data Mining and Cleaning Components costs $1,995. NAG offers versions for both Microsoft Windows and Linux.

Created in 1970, NAG is a collaborative venture among a number of British universities and government libraries to develop and sell libraries of numerical and statistical subroutines.

About the Author

Joab Jackson is the senior technology editor for Government Computer News.


  • 2020 Government Innovation Awards
    Government Innovation Awards -

    21 Public Sector Innovation award winners

    These projects at the federal, state and local levels show just how transformative government IT can be.

  • Federal 100 Awards
    cheering federal workers

    Nominations for the 2021 Fed 100 are now being accepted

    The deadline for submissions is Dec. 31.

Stay Connected