Skip to Content

UFO paranormal Directory, Community and News

Artificial Intelligence: Machine Learning: Datasets




Datasets

Links

The RCSB Protein Data Bank (PDB)

http://www.rcsb.org/pdb/

Archive of experimentally-determined, biological macromolecule 3-D structures from the Brookhaven National Laboratory.

Review It Rate It Bookmark It

Dataset generator

http://www.datgen.com/

Datgen, formerly SCDS, is a computer program that generates data to systematically test programs that consume data. These synthetic datasets can be used to validate learning algorithms.

Review It Rate It Bookmark It

DELVE - Data for Evaluating Learning in Valid Experiments

http://www.cs.utoronto.ca/~delve/

Data for Evaluating Learning Valid Experiments: A standardized environment designed to evaluate the performance of methods that learn relationships based primarily on empirical data. Delve makes it possible for users to compare their learning methods with other methods on many datasets.

Review It Rate It Bookmark It

UCI Machine Learning Repository

http://www.ics.uci.edu/~mlearn/MLRepository.html

A repository of databases, domain theories and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms.

Review It Rate It Bookmark It

TREC Data

http://trec.nist.gov/data.html

Text datasets used in information retrieval and learning in text domains.

Review It Rate It Bookmark It

National Space Science Data Center

http://nssdc.gsfc.nasa.gov/

Provides access to a wide variety of astrophysics, space physics, solar physics, lunar and planetary data from NASA space flight missions, in addition to selected other data and some models and software.

Review It Rate It Bookmark It

The StatLib Datasets Archive

http://lib.stat.cmu.edu/datasets/

A repository of datasets used in statistics and machine learning.

Review It Rate It Bookmark It

Time Series Data Library

http://www-personal.buseco.monash.edu.au/~hyndman/TSDL/

A collection of over 500 time series, maintained by Rob Hyndman. Time series are organized by subject.

Review It Rate It Bookmark It

Penn Treebank Project

http://www.cis.upenn.edu/~treebank/

A corpus of parsed sentences. Used by many researchers for training data-driven parsing algorithms.

Review It Rate It Bookmark It

HS3D - Homo Sapiens Splice Sites Dataset

http://www.sci.unisannio.it/docenti/rampone/

HS3D (Homo Sapiens Splice Sites Dataset) is a database of Homo Sapiens Exon, Intron and Splice regions extracted from GenBank primate sequences Rel.123. The aim of this data set is to give standardized material to train and to assess the prediction accuracy of computational approaches for gene identification and characterization.

Review It Rate It Bookmark It

Learning Relational Concepts from Sensor Data of a Mobile Robot

http://www-ai.cs.uni-dortmund.de/FORSCHUNG/PROJEKTE/BLEARN2/data-sets.html

A set of data sets, where each data set is represented in first order logic. Maintained at the University of Dortmund, Germany.

Review It Rate It Bookmark It

Web->KB dataset

http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-11/www/wwkb/

Web pages partitioned into classes, with hyperlink data. The dataset has been used for text categorization and learning to extract symbolic knowledge from the World Wide Web.

Review It Rate It Bookmark It

RISE: Repository of Information Sources used in information Extraction tasks.

http://www.isi.edu/info-agents/RISE/

Repository of online information sources: test domains for information extraction and wrapper generation tools that learn extraction rules (extraction patterns).

Review It Rate It Bookmark It

UFO Home |  Webmasters |  Add a Site |  Modify a Site |  New |  Cool |  Top Rated |  Bookmarks
Category for: Artificial Intelligence: Machine Learning: Datasets