machine-learning

Machine learning datasets

Since January 2018


Recent events

No events.

Datasets 94

Hepatitis

hepatitis | files 2 | 96kB
updated 4 months ago

This dataset contains occurrences of hepatitis in people. Data This dataset was found on OpenML - hepatitis Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia Attribute information bilirubin: 0.3 - 4.8 alk_phosphate: 33 - explore more

Creditcard

creditcard | files 2 | 855MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1597 Author: Andrea Dal Pozzolo, Olivier Caelen and Gianluca Bontempi Source: Credit card fraud detection - Date 25th of June 2015 Please cite: Andrea Dal Pozzolo, Olivier Caelen, Reid A. Johnson and Gianluca Bontempi. explore more

Bioresponse

bioresponse | files 2 | 235MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/4134 Author: Boehringer Ingelheim Source: Kaggle - 2011 Please cite: None Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains explore more

Kddcup99

kddcup99 | files 2 | 686MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1113 Author: Source: Unknown - Date unknown Please cite: This is a 10% stratified subsample of the data from the 1999 ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php). Modified by TunedIT (converted to ARFF explore more

Arrhythmia

arrhythmia | files 3 | 5MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/5 Author: H. Altay Guvenir, Burak Acar, Haldun Muderrisoglu Source: UCI Please cite: UCI Cardiac Arrhythmia Database The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 explore more

Vehicle

vehicle | files 3 | 715kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/54 Author: Dr. Pete Mowforth and Dr. Barry Shepherd Source: UCI) Please cite: Siebert,JP. Turing Institute Research Memorandum TIRM-87-018 "Vehicle Recognition Using Rule Based Methods" (March 1987) NAME vehicle explore more

Seismic bumps

seismic-bumps | files 3 | 70kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1500 Author: Sikora M., Wrobel L. Source: UCI Please cite: Sikora M., Wrobel L.: Application of rule induction algorithms for analysis of data collected by seismic hazard monitoring systems in coal mines. Archives of Mining explore more

Steel plates fault

steel-plates-fault | files 3 | 2MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1504 Author: Semeion, Research Center of Sciences of Communication, Rome, Italy. Source: UCI Please cite: Dataset provided by Semeion, Research Center of Sciences of Communication, Via Sersale 117, 00128, Rome, Italy. Steel explore more

Mozilla4

mozilla4 | files 3 | 4MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1046 Author: Source: Unknown - Date unknown Please cite: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to explore more

Pendigits

pendigits | files 3 | 7MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/32 Author: E. Alpaydin, Fevzi. Alimoglu Source: UCI Machine Learning Repository Please cite: UCI citation policy Pen-Based Recognition of Handwritten Digits We create a digit database by collecting 250 samples from 44 explore more

Climate model simulation crashes

climate-model-simulation-crashes | files 3 | 597kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1467 Author: D. Lucas, R. Klein, J. Tannahill, D. Ivanova, S. Brandon, D. Domyancic, Y. Zhang. Source: UCI Please Cite: Lucas, D. D., Klein, R., Tannahill, J., Ivanova, D., Brandon, S., Domyancic, D., and Zhang, Y.: Failure explore more

Breast w

breast-w | files 3 | 295kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/15 Author: Dr. William H. Wolberg, University of Wisconsin Source: UCI), University of Wisconsin - 1995 Please cite: See below, plus UCI Breast Cancer Wisconsin (Original) Data Set. Features are computed from a digitized explore more

Kc1

kc1 | files 3 | 2MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1067 Author: Mike Chapman, NASA Source: tera-PROMISE - 2004 Please cite: Sayyad Shirabad, J. and Menzies, T.J. (2005) The PROMISE Repository of Software Engineering Databases. School of Information Technology and Engineering, explore more

Kc2

kc2 | files 3 | 415kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1063 Author: Mike Chapman, NASA Source: tera-PROMISE - 2004 Please cite: Sayyad Shirabad, J. and Menzies, T.J. (2005) The PROMISE Repository of Software Engineering Databases. School of Information Technology and Engineering, explore more

Cmc

cmc | files 3 | 536kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/23 Author: Tjen-Sien Lim Source: As obtained from UCI Please cite: UCI citation Title: Contraceptive Method Choice Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia explore more

Banknote authentication

banknote-authentication | files 3 | 321kB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1462 Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: UCI - 2012 Please cite: UCI Dataset about distinguishing genuine and forged banknotes. Data were extracted from images that were taken explore more

Cervical cancer

cervical-cancer | files 2 | 946kB
updated 9 months ago

This is dataset about cervical cancer occurrences. Cervical cancer is one the most frequent cancer diseases that occur to women. This dataset is showing some factors that might influence cervical cancer. Data This dataset was found on UCI under the name Cervical cancer (Risk Factors) Data explore more

Primary tumor

primary-tumor | files 2 | 180kB
updated 9 months ago

This is a dataset about primary tumors in people. Locations of primary tumors are locations in body where the tumor first appeared and from there started to metastasize to other parts of the body. Data This dataset was found on OpenML - primary-tumor This primary tumor domain was obtained from explore more

Fertility

fertility | files 2 | 28kB
updated 9 months ago

This is dataset containing fertility instances. Data This dataset was found under the name Fertility Data Set 100 instances 10 attributes Missing values : NO Data is located in directory called data data/fertility.csv Attributes are the same as they were in input data. Preparation To get explore more

Magictelescope

magictelescope | files 2 | 11MB
updated 1 month ago

The resources for this dataset can be found at https://www.openml.org/d/1120 Author: R. K. Bock. Major Atmospheric Gamma Imaging Cherenkov Telescope project (MAGIC) Donated by P. Savicky, Institute of Computer Science, AS of CR, Czech Republic Source: UCI Please cite: Bock, R.K., Chilingarian, A., explore more