Software
Links
Meta-MEME v2.0.1
Software toolkit for building and using motif-based hidden Markov models of DNA and proteins - from the Univ. of California-San Diego.
SUBDUE Knowledge Discovery in Structural Databases
The program discovers interesting and repetitive subgraphs in a labeled graph representation using the minimum description length principle. Applications to molecular biology. [Free]
HMM and other statistical programs
http://www.cfar.umd.edu/~kanungo/software/software.html
On this page an imlementation of Hidden Markov Models and an application to part-of-speech tagging. Also available a multivariate hypothesis testing software for Gaussian Data and TRUEVIZ: A groundtruth/metadata Editing and Visualizing Toolkit for OCR.
Pfam
A large collection of multiple sequence alignments and trained hidden Markov models covering many common protein domains.
Multiple EM for Motif Elicitation and Motif Alignment and Search Tool (MEME/MAST)
http://meme.sdsc.edu/meme/website/
MEME System is a program for discovering motifs in groups of related DNA or protein sequences. MAST is a tool for searching biological sequence databases for sequences that contain one or more of a group of known motifs.
Sequence Alignment and Modeling System (SAM)
http://www.cse.ucsc.edu/research/compbio/sam.html
A collection of tools for creating and using HMMs for biological sequences. Free license for academic and nonprofit usages.
MIX
http://icarus.math.mcmaster.ca/peter/mix/mix.html
Software for learning Mixture Distributions. Commercial license.
Machine Learning Programs by Peter Clark
http://www.cs.utexas.edu/users/pclark/software.html
QM: Guiding inductive learning with a Qualitative Model. LPE: Lazy Partial Evaluation. CN2: Rule induction from examples. [Free]
Statistical Decision Trees
http://www.isip.msstate.edu/projects/speech/software/legacy/decision_tree/index.html
A program for inducing Bayesian decision trees. Applications to speech. [Free]
Software Packages for Graphical Models/Bayesian Networks
http://www.ai.mit.edu/~murphyk/Software/bnsoft.html
Directory of software tools for modeling graphs and Bayesian networks. Some have learning capabilities.
Observable Operator Modeling Kit
Machine learning library for Observable Operator Models (OOMs) suitable for time-series and sequence data classification and prediction. OOMs are similar but more powerful than HMMs. [C++, BSD license]
GNU Hidden Markov Model Library
http://sourceforge.net/projects/ghmm
Hidden Markov Models software library from the Center of Applied Informatics, Cologne. Includes algorithms such as Viterbi, Baum-Welch, and Forward-Backward. [C, GPL license]
Bayes++
http://bayesclasses.sourceforge.net
A library of C++ classes for Bayesian filtering. From the Australian Centre for Field Robotics. [C++, MIT license]
libbpfl - Bayesian Probability Filtering Library
http://libbpfl.sourceforge.net
A general purpose library for Bayesian filtering. [C++, LGPL license]
XELOPES Data Mining Library
http://www.prudsys.com/Produkte/Algorithmen/Xelopes
Platform- and data-source-independent library for embedded data mining based on the CWM/OMG and other data mining standards. XELOPES-Java algorithms: SVMs, market basket analysis, sequence analysis, decision trees, cluster analysis, multidimensional grouping. XELOPES-C++ algorithms: SVMs, decision trees. [GPL]
Experience-Based Language Acquisition
http://sourceforge.net/projects/ebla
Computational model of human language acquisition written in Java; currently acquires a protolanguage of nouns and verbs language based on visual perception



