Takes a database of cases described by a combination of real and discrete valued attributes, and automatically finds the natural classes in that data. It can be seen as a Naive Bayes classifier where the class node is hidden. [Free] http://ic-www.arc.nasa.gov/ic/projects/bayes-group/autoclass/
Platform- and data-source-independent library for embedded data mining based on the CWM/OMG and other data mining standards. XELOPES-Java algorithms: SVMs, market basket analysis, sequence analysis, decision trees, cluster analysis, multidimensional group http://www.prudsys.com/Produkte/Algorithmen/Xelopes
The program discovers interesting and repetitive subgraphs in a labeled graph representation using the minimum description length principle. Applications to molecular biology. [Free] http://cygnus.uta.edu/subdue/
Supports several inference algorithms and learning algorithms. Allows simulation of static and dynamic networks, including HMMs, IOHMMs, and Kalman filters. http://www.cs.berkeley.edu/~murphyk/Bayes/bnt.html
Suite of Perl tools for counting and analyzing word n-grams in text; provides standard tests of association for identifying word n-grams in large corpora and allows users to implement other tests with minimal Perl knowledge. http://www.d.umn.edu/~tpederse/nsp.html
On this page an imlementation of Hidden Markov Models and an application to part-of-speech tagging. Also available a multivariate hypothesis testing software for Gaussian Data and TRUEVIZ: A groundtruth/metadata Editing and Visualizing Toolkit for OCR. http://www.cfar.umd.edu/~kanungo/software/software.html
A general approach to the problem of inducing natural language parsers. It uses an annotated corpus, and produces a parser by using ILP for inducing the rules that control the actions of a shift-reduce parser. [Free] http://www.cs.utexas.edu/users/ml/chill.html