Benchmark Dataset Collection
Datasets Summary
| Dataset | Summary | Type |
| Delve | Delve consists of:
1. A software environment, which allows you to manipulate datasets and
do statistical analysis of method performance. |
2C 6R |
| ELENA | Deals with classification.Comprehensive analysis on the dataset characteristics and various classification methods. Contains 3 artificial datasets with nonlinear properties. | 7C |
| PRNN | This collection contains datasets used in the book Pattern Recognition and Neural Networks by B.D. Ripley (1996) Cambridge University Press ISBN 0 521 46986 7 (hardback, 416 pages, 29.95 pounds, $49.95) | |
| PROBEN1 | Proben1 is a collection of benchmark problems for neural network research,
accompanied by a set of rules and conventions for their application. It
consists of - a set of data files in a very simple common format, - the set of original data files these files were created from, - documentation for each of these files, and - a technical report describing the set of problems and the rules |
11C 4R |
| StatLib (CMU) | Datasets from CMU Statistics Department | |
| STATLOG | Project StatLog (Esprit Project 5170) was concerned with comparative studies of different machine learning, neural and statistical classification algorithms. About 20 different algorithms were evaluated on more than 20 (only 10 are available publicly) different datasets. The tests carried out under project produced many interesting results. The results(not available online) of these tests are comprehensively described in a book (D.Michie et.al, 1994). | 10C |
| UC Irvine Repository | A huge database of machine learning datasets. | >74C >3R |
| C: | Classification |
| R: | Regression |