machine-learning

Machine learning datasets

Since January 2018


Recent events

No events.

Datasets 94

Autouniv au6 1000

autouniv-au6-1000 | files 2 | 1MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/1555 Author: Ray. J. Hickey Source: UCI Please cite: Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the explore more

Jm1

jm1 | files 3 | 9MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/1053 Author: Mike Chapman, Galaxy Global Corporation Source: PROMISE Repository Please cite: please follow the acknowledgment guidelines posted on the PROMISE repository web page. This is a PROMISE data set made publicly explore more

Abalone

abalone | files 3 | 2MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/183 Author: Source: Unknown - Please cite: Title of Database: Abalone data Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry explore more

Semeion

semeion | files 3 | 8MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/1501 Author: Semeion Research Center of Sciences of Communication Source: UCI Please cite: Semeion Research Center of Sciences of Communication, via Sersale 117, 00128 Rome, Italy Tattile Via Gaetano Donizetti, 1-3-5,25030 explore more

Soybean

soybean | files 3 | 1MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/42 Author: R.S. Michalski and R.L. Chilausky (Donors: Ming Tan & Jeff Schlimmer) Source: UCI) - 1988 Please cite: R.S. Michalski and R.L. Chilausky "Learning by Being Told and Learning from Examples: An Experimental Comparison explore more

Optdigits

optdigits | files 3 | 12MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/28 Author: E. Alpaydin, C. Kaynak Source: UCI Please cite: UCI citation policy Title of Database: Optical Recognition of Handwritten Digits Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici explore more

Profb

profb | files 3 | 238kB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/470 Author: Hal Stern, Robin Lock Source: StatLib Please cite: PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? explore more

Lung cancer

lung-cancer | files 3 | 62kB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/163 Author: Source: Unknown - Please cite: Title: Lung Cancer Data Source Information: Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method explore more

Cardiotocography

cardiotocography | files 3 | 3MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/1466 Author: J. P. Marques de Sá, J. Bernardes, D. Ayers de Campos. Source: UCI Please cite: Ayres de Campos et al. (2000) SisPorto 2.0 A Program for Automated Analysis of Cardiotocograms. J Matern Fetal Med 5:311-318, explore more

Credit g

credit-g | files 3 | 1MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/31 Author: Dr. Hans Hofmann Source: UCI) - 1994 Please cite: UCI German Credit data This dataset classifies people described by a set of attributes as good or bad credit risks. This dataset comes with a cost matrix: explore more

Mfeat morphological

mfeat-morphological | files 3 | 801kB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/18 Author: Robert P.W. Duin, Department of Applied Physics, Delft University of Technology Source: UCI - 1998 Please cite: UCI Multiple Features Dataset: Morphological One of a set of 6 datasets describing features of explore more

Spambase

spambase | files 3 | 12MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/44 Author: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt Source: UCI Please cite: UCI SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain explore more

Yeast

yeast | files 3 | 456kB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/181 Author: Source: Unkn explore more

Pc3

pc3 | files 3 | 2MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/1050 Author: Mike Chapman, NASA Source: tera-PROMISE - 2004 Please cite: Sayyad Shirabad, J. and Menzies, T.J. (2005) The PROMISE Repository of Software Engineering Databases. School of Information Technology and Engineering, explore more

Scene

scene | files 3 | 60MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/312 Author: Matthew R. Boutell, Jiebo Luo, Xipeng Shen, and Christopher M. Brown. Source: Mulan Please cite: Description Scene recognition dataset - It contains characteristics about images and their classes. The original explore more

Kropt

kropt | files 3 | 7MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/184 Author: Source: Unkn explore more

Credit approval

credit-approval | files 3 | 266kB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/29 Author: Confidential - Donated by Ross Quinlan Source: UCI - 1987 Please cite: UCI Credit Approval This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to explore more

Speed dating

speed-dating | files 2 | 46MB
updated 1 year ago

This dataset is about speed dating. This data was gathered from participants in experimental speed dating events from 2002-2004. During the events, the attendees would have a four-minute "first date" with every other participant of the opposite sex. At the end of their four minutes, participants explore more

Satellite

satellite | files 2 | 6MB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/40900 Author: Markus Goldstein Source: Dataverse Please cite: The satellite dataset comprises of features extracted from satellite observations. In particular, each image was taken under four different light wavelength, two explore more

Covertype

covertype | files 2 | 1GB
updated 1 year ago

The resources for this dataset can be found at https://www.openml.org/d/150 Author: Albert Bifet Source: MOA - 2009 Please cite: Normalized version of the Forest Covertype dataset (see version 1), so that the numerical values are between 0 and 1. Contains the forest cover type for 30 x 30 meter explore more