openml-datasets

8,287
0
Updated:
Files:2
Size:17.1 MB
Formats:csvarff
License:ODC-PDDL

API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files
Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/core/openml-datasets/
https://datahub.io/core/openml-datasets/_r/-/FRESHNESS_CHECK.md
https://datahub.io/core/openml-datasets/_r/-/README.md
https://datahub.io/core/openml-datasets/_r/-/UPDATE_SCRIPT_MAINTENANCE_REPORT.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.arff
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.csv
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.arff
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.csv
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.arff
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.csv
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/README.md
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.arff
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.csv
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/README.md
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.arff
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.csv
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/README.md
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.arff
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.csv
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/abalone/README.md
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.arff
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.csv
https://datahub.io/core/openml-datasets/_r/-/data/abalone/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/adult/README.md
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.arff
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.csv
https://datahub.io/core/openml-datasets/_r/-/data/adult/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/airlines/README.md
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.arff
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.csv
https://datahub.io/core/openml-datasets/_r/-/data/airlines/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/README.md
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.arff
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.csv
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/anneal/README.md
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.arff
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.csv
https://datahub.io/core/openml-datasets/_r/-/data/anneal/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/README.md
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.arff
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.csv
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.arff
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.csv
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autos/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.arff
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.csv
https://datahub.io/core/openml-datasets/_r/-/data/autos/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/README.md
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.arff
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.csv
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/README.md
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.arff
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.csv
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/README.md
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.arff
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.csv
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.arff
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.csv
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.arff
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.csv
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cmc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.arff
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.csv
https://datahub.io/core/openml-datasets/_r/-/data/cmc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.arff
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.csv
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/README.md
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.arff
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.csv
https://datahub.io/core/openml-datasets/_r/-/data/electricity/README.md
https://datahub.io/core/openml-datasets/_r/-/data/electricity/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.arff
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.csv
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.arff
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.csv
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/README.md
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.arff
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.csv
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.arff
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.csv
https://datahub.io/core/openml-datasets/_r/-/data/glass/README.md
https://datahub.io/core/openml-datasets/_r/-/data/glass/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.arff
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.csv
https://datahub.io/core/openml-datasets/_r/-/data/haberman/README.md
https://datahub.io/core/openml-datasets/_r/-/data/haberman/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.arff
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.csv
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/README.md
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.arff
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.csv
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/README.md
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.arff
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.csv
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.arff
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.csv
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.arff
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.csv
https://datahub.io/core/openml-datasets/_r/-/data/iris/README.md
https://datahub.io/core/openml-datasets/_r/-/data/iris/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.arff
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.csv
https://datahub.io/core/openml-datasets/_r/-/data/isolet/README.md
https://datahub.io/core/openml-datasets/_r/-/data/isolet/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.arff
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.csv
https://datahub.io/core/openml-datasets/_r/-/data/jm1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/jm1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.arff
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.csv
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.arff
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.csv
https://datahub.io/core/openml-datasets/_r/-/data/kropt/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kropt/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.arff
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.csv
https://datahub.io/core/openml-datasets/_r/-/data/letter/README.md
https://datahub.io/core/openml-datasets/_r/-/data/letter/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.arff
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.csv
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/README.md
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.arff
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.csv
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/lymph/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lymph/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.arff
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.csv
https://datahub.io/core/openml-datasets/_r/-/data/madelon/README.md
https://datahub.io/core/openml-datasets/_r/-/data/madelon/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.arff
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.csv
https://datahub.io/core/openml-datasets/_r/-/data/mammography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mammography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.arff
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.csv
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.arff
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.csv
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.arff
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.csv
https://datahub.io/core/openml-datasets/_r/-/data/musk/README.md
https://datahub.io/core/openml-datasets/_r/-/data/musk/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.arff
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.csv
https://datahub.io/core/openml-datasets/_r/-/data/nursery/README.md
https://datahub.io/core/openml-datasets/_r/-/data/nursery/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.arff
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.csv
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/README.md
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.arff
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.csv
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/README.md
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.arff
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc3/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc3/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.csv
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/README.md
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.arff
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.csv
https://datahub.io/core/openml-datasets/_r/-/data/profb/README.md
https://datahub.io/core/openml-datasets/_r/-/data/profb/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.arff
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.csv
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/README.md
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.arff
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.csv
https://datahub.io/core/openml-datasets/_r/-/data/satimage/README.md
https://datahub.io/core/openml-datasets/_r/-/data/satimage/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.arff
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.csv
https://datahub.io/core/openml-datasets/_r/-/data/scene/README.md
https://datahub.io/core/openml-datasets/_r/-/data/scene/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.arff
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.csv
https://datahub.io/core/openml-datasets/_r/-/data/segment/README.md
https://datahub.io/core/openml-datasets/_r/-/data/segment/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.arff
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.csv
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/README.md
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.arff
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.csv
https://datahub.io/core/openml-datasets/_r/-/data/semeion/README.md
https://datahub.io/core/openml-datasets/_r/-/data/semeion/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.arff
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.csv
https://datahub.io/core/openml-datasets/_r/-/data/sick/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sick/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.arff
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.csv
https://datahub.io/core/openml-datasets/_r/-/data/sonar/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sonar/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.arff
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.csv
https://datahub.io/core/openml-datasets/_r/-/data/soybean/README.md
https://datahub.io/core/openml-datasets/_r/-/data/soybean/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.arff
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.csv
https://datahub.io/core/openml-datasets/_r/-/data/spambase/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spambase/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.arff
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.csv
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.arff
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.csv
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/README.md
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.arff
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.csv
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/README.md
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.arff
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.csv
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.arff
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.csv
https://datahub.io/core/openml-datasets/_r/-/data/vote/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vote/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.arff
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.csv
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.arff
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.csv
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.arff
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.csv
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.arff
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.csv
https://datahub.io/core/openml-datasets/_r/-/data/yeast/README.md
https://datahub.io/core/openml-datasets/_r/-/data/yeast/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.arff
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.csv
https://datahub.io/core/openml-datasets/_r/-/datapackage.json
Key Files

Start with these files — they give you everything you need to understand and access the dataset.

https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
README.mddocumentation
https://datahub.io/core/openml-datasets/_r/-/README.md
Typical Usage
  1. 1. Fetch data/Bioresponse/datapackage.json to inspect schema and resources
  2. 2. Download data resources listed in data/Bioresponse/datapackage.json
  3. 3. Read README.md for full context

Data Files

Explore with AI

gina_agnostic

Download

Download CSV

About

Last updated
9 February 2026
Total rows
...
Format
CSV
File size
8.53 MB

gina_agnostic

Unsupported data preview format `arff`

About

Last updated
9 February 2026
Format
ARFF
File size
8.54 MB

About this dataset

The resources for this dataset can be found at https://www.openml.org/d/1038

Author: Isabelle Guyon
Source: Agnostic Learning vs. Prior Knowledge Challenge
Please cite: None

Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of 5 different datasets (SYLVA, GINA, NOVA, HIVA, ADA). The purpose of the challenge was to check if the performance of domain-specific feature engineering (prior knowledge) can be met by algorithms that were trained on data without any domain-specific knowledge (agnostic). For the latter, the data was anonymised and preprocessed in a way that makes them uninterpretable.

Modified by TunedIT (converted to ARFF format)

Topic

The task of GINA is handwritten digit recognition. This is the agnostic version of a subset of the MNIST data set. We chose the problem of separating the odd numbers from even numbers. We use 2-digit numbers. Only the unit digit is informative for that task, therefore at least ½ of the features are distracters. This is a twoclass classification problem with sparse continuous input variables, in which each class is composed of several clusters. It is a problems with heterogeneous classes.

Source

The data set was constructed from the MNIST data that is made available by Yann LeCun of the NEC Research Institute at http://yann.lecun.com/exdb/mnist/. The digits have been size-normalized and centered in a fixed-size image of dimension 28x28. Examples are shown in the documentation in chapter 3.

Description

To construct the “agnostic” dataset, we performed the following steps:

  • We removed the pixels that were 99% of the time white. This reduced the original feature set of 784 pixels to 485.
  • The original resolution (256 gray levels) was kept.
  • In spite of the fact that the data are rather sparse (about 30% of the values are non-zero), we saved the data as a dense matrix because we found that it can be compressed better in this way (to 19 MB.)
  • The feature names are the (i,j) matrix coordinates of the pixels (in a 28x28 matrix.)
  • We created 2 digit numbers by dividing the datasets into to parts and pairing the digits at random.
  • The task is to separate odd from even numbers. The digit of the tens being not informative, the features of that digit act as distracters. To construct the “prior” dataset, we went back to the original data and fetched the “informative” digit in its original representation. Therefore, this data representation consists in a vector of concatenating the lines of a 28x28 pixel map.

Data type: non-sparse
Number of features: 970
Number of examples and check-sums:
Pos_ex Neg_ex Tot_ex Check_sum
Train 1550 1603 3153 164947945.00
Valid 155 160 315 16688946.00

This dataset contains samples from both training and validation datasets.