Updated

openml-datasets

2.1K
API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files
Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/core/openml-datasets/
https://datahub.io/core/openml-datasets/_r/-/FRESHNESS_CHECK.md
https://datahub.io/core/openml-datasets/_r/-/README.md
https://datahub.io/core/openml-datasets/_r/-/UPDATE_SCRIPT_MAINTENANCE_REPORT.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.arff
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.csv
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.arff
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.csv
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.arff
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.csv
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/README.md
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.arff
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.csv
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/README.md
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.arff
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.csv
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/README.md
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.arff
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.csv
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/abalone/README.md
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.arff
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.csv
https://datahub.io/core/openml-datasets/_r/-/data/abalone/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/adult/README.md
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.arff
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.csv
https://datahub.io/core/openml-datasets/_r/-/data/adult/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/airlines/README.md
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.arff
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.csv
https://datahub.io/core/openml-datasets/_r/-/data/airlines/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/README.md
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.arff
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.csv
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/anneal/README.md
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.arff
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.csv
https://datahub.io/core/openml-datasets/_r/-/data/anneal/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/README.md
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.arff
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.csv
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.arff
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.csv
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autos/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.arff
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.csv
https://datahub.io/core/openml-datasets/_r/-/data/autos/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/README.md
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.arff
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.csv
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/README.md
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.arff
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.csv
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/README.md
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.arff
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.csv
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.arff
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.csv
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.arff
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.csv
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cmc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.arff
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.csv
https://datahub.io/core/openml-datasets/_r/-/data/cmc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.arff
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.csv
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/README.md
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.arff
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.csv
https://datahub.io/core/openml-datasets/_r/-/data/electricity/README.md
https://datahub.io/core/openml-datasets/_r/-/data/electricity/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.arff
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.csv
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.arff
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.csv
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/README.md
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.arff
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.csv
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.arff
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.csv
https://datahub.io/core/openml-datasets/_r/-/data/glass/README.md
https://datahub.io/core/openml-datasets/_r/-/data/glass/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.arff
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.csv
https://datahub.io/core/openml-datasets/_r/-/data/haberman/README.md
https://datahub.io/core/openml-datasets/_r/-/data/haberman/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.arff
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.csv
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/README.md
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.arff
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.csv
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/README.md
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.arff
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.csv
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.arff
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.csv
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.arff
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.csv
https://datahub.io/core/openml-datasets/_r/-/data/iris/README.md
https://datahub.io/core/openml-datasets/_r/-/data/iris/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.arff
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.csv
https://datahub.io/core/openml-datasets/_r/-/data/isolet/README.md
https://datahub.io/core/openml-datasets/_r/-/data/isolet/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.arff
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.csv
https://datahub.io/core/openml-datasets/_r/-/data/jm1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/jm1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.arff
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.csv
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.arff
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.csv
https://datahub.io/core/openml-datasets/_r/-/data/kropt/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kropt/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.arff
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.csv
https://datahub.io/core/openml-datasets/_r/-/data/letter/README.md
https://datahub.io/core/openml-datasets/_r/-/data/letter/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.arff
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.csv
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/README.md
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.arff
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.csv
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/lymph/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lymph/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.arff
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.csv
https://datahub.io/core/openml-datasets/_r/-/data/madelon/README.md
https://datahub.io/core/openml-datasets/_r/-/data/madelon/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.arff
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.csv
https://datahub.io/core/openml-datasets/_r/-/data/mammography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mammography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.arff
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.csv
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.arff
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.csv
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.arff
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.csv
https://datahub.io/core/openml-datasets/_r/-/data/musk/README.md
https://datahub.io/core/openml-datasets/_r/-/data/musk/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.arff
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.csv
https://datahub.io/core/openml-datasets/_r/-/data/nursery/README.md
https://datahub.io/core/openml-datasets/_r/-/data/nursery/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.arff
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.csv
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/README.md
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.arff
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.csv
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/README.md
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.arff
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc3/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc3/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.csv
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/README.md
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.arff
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.csv
https://datahub.io/core/openml-datasets/_r/-/data/profb/README.md
https://datahub.io/core/openml-datasets/_r/-/data/profb/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.arff
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.csv
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/README.md
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.arff
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.csv
https://datahub.io/core/openml-datasets/_r/-/data/satimage/README.md
https://datahub.io/core/openml-datasets/_r/-/data/satimage/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.arff
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.csv
https://datahub.io/core/openml-datasets/_r/-/data/scene/README.md
https://datahub.io/core/openml-datasets/_r/-/data/scene/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.arff
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.csv
https://datahub.io/core/openml-datasets/_r/-/data/segment/README.md
https://datahub.io/core/openml-datasets/_r/-/data/segment/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.arff
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.csv
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/README.md
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.arff
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.csv
https://datahub.io/core/openml-datasets/_r/-/data/semeion/README.md
https://datahub.io/core/openml-datasets/_r/-/data/semeion/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.arff
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.csv
https://datahub.io/core/openml-datasets/_r/-/data/sick/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sick/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.arff
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.csv
https://datahub.io/core/openml-datasets/_r/-/data/sonar/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sonar/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.arff
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.csv
https://datahub.io/core/openml-datasets/_r/-/data/soybean/README.md
https://datahub.io/core/openml-datasets/_r/-/data/soybean/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.arff
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.csv
https://datahub.io/core/openml-datasets/_r/-/data/spambase/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spambase/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.arff
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.csv
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.arff
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.csv
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/README.md
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.arff
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.csv
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/README.md
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.arff
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.csv
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.arff
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.csv
https://datahub.io/core/openml-datasets/_r/-/data/vote/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vote/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.arff
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.csv
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.arff
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.csv
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.arff
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.csv
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.arff
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.csv
https://datahub.io/core/openml-datasets/_r/-/data/yeast/README.md
https://datahub.io/core/openml-datasets/_r/-/data/yeast/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.arff
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.csv
https://datahub.io/core/openml-datasets/_r/-/datapackage.json
Key Files

Start with these files — they give you everything you need to understand and access the dataset.

https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
README.mddocumentation
https://datahub.io/core/openml-datasets/_r/-/README.md
Typical Usage
  1. 1. Fetch data/Bioresponse/datapackage.json to inspect schema and resources
  2. 2. Download data resources listed in data/Bioresponse/datapackage.json
  3. 3. Read README.md for full context

Data Previews

sonar

Unsupported data preview format `arff`

sonar

Loading data...

Schema

nametypeformat
attribute_1numberdefault
attribute_2numberdefault
attribute_3numberdefault
attribute_4numberdefault
attribute_5numberdefault
attribute_6numberdefault
attribute_7numberdefault
attribute_8numberdefault
attribute_9numberdefault
attribute_10numberdefault
attribute_11numberdefault
attribute_12numberdefault
attribute_13numberdefault
attribute_14numberdefault
attribute_15numberdefault
attribute_16numberdefault
attribute_17numberdefault
attribute_18numberdefault
attribute_19numberdefault
attribute_20numberdefault
attribute_21numberdefault
attribute_22numberdefault
attribute_23numberdefault
attribute_24numberdefault
attribute_25numberdefault
attribute_26numberdefault
attribute_27numberdefault
attribute_28numberdefault
attribute_29numberdefault
attribute_30numberdefault
attribute_31numberdefault
attribute_32numberdefault
attribute_33numberdefault
attribute_34numberdefault
attribute_35numberdefault
attribute_36numberdefault
attribute_37numberdefault
attribute_38numberdefault
attribute_39numberdefault
attribute_40numberdefault
attribute_41numberdefault
attribute_42numberdefault
attribute_43numberdefault
attribute_44numberdefault
attribute_45numberdefault
attribute_46numberdefault
attribute_47numberdefault
attribute_48numberdefault
attribute_49numberdefault
attribute_50numberdefault
attribute_51numberdefault
attribute_52numberdefault
attribute_53numberdefault
attribute_54numberdefault
attribute_55numberdefault
attribute_56numberdefault
attribute_57numberdefault
attribute_58numberdefault
attribute_59numberdefault
attribute_60numberdefault
Classstringdefault

Data Files

FileDescriptionSizeLast modifiedDownload
sonar
94.7 kB3 months ago
sonar
sonar
87.3 kB3 months ago
sonar
FilesSizeFormatCreatedUpdatedLicenseSource
2182 kBarff, csvabout 2 months agoOpen Data Commons Public Domain Dedication and License

The resources for this dataset can be found at https://www.openml.org/d/40

Author:
Source: Unknown -
Please cite:

NAME: Sonar, Mines vs. Rocks

SUMMARY: This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network to discriminate between sonar signals bounced off a metal cylinder and those bounced off a roughly cylindrical rock.

SOURCE: The data set was contributed to the benchmark collection by Terry Sejnowski, now at the Salk Institute and the University of California at San Deigo. The data set was developed in collaboration with R. Paul Gorman of Allied-Signal Aerospace Technology Center.

MAINTAINER: Scott E. Fahlman

PROBLEM DESCRIPTION:

The file "sonar.mines" contains 111 patterns obtained by bouncing sonar signals off a metal cylinder at various angles and under various conditions. The file "sonar.rocks" contains 97 patterns obtained from rocks under similar conditions. The transmitted sonar signal is a frequency-modulated chirp, rising in frequency. The data set contains signals obtained from a variety of different aspect angles, spanning 90 degrees for the cylinder and 180 degrees for the rock.

Each pattern is a set of 60 numbers in the range 0.0 to 1.0. Each number represents the energy within a particular frequency band, integrated over a certain period of time. The integration aperture for higher frequencies occur later in time, since these frequencies are transmitted later during the chirp.

The label associated with each record contains the letter "R" if the object is a rock and "M" if it is a mine (metal cylinder). The numbers in the labels are in increasing order of aspect angle, but they do not encode the angle directly.

METHODOLOGY:

This data set can be used in a number of different ways to test learning speed, quality of ultimate learning, ability to generalize, or combinations of these factors.

In [1], Gorman and Sejnowski report two series of experiments: an "aspect-angle independent" series, in which the whole data set is used without controlling for aspect angle, and an "aspect-angle dependent" series in which the training and testing sets were carefully controlled to ensure that each set contained cases from each aspect angle in appropriate proportions.

For the aspect-angle independent experiments the combined set of 208 cases is divided randomly into 13 disjoint sets with 16 cases in each. For each experiment, 12 of these sets are used as training data, while the 13th is reserved for testing. The experiment is repeated 13 times so that every case appears once as part of a test set. The reported performance is an average over the entire set of 13 different test sets, each run 10 times.

It was observed that this random division of the sample set led to rather uneven performance. A few of the splits gave poor results, presumably because the test set contains some samples from aspect angles that are under-represented in the corresponding training set. This motivated Gorman and Sejnowski to devise a different set of experiments in which an attempt was made to balance the training and test sets so that each would have a representative number of samples from all aspect angles. Since detailed aspect angle information was not present in the data base of samples, the 208 samples were first divided into clusters, using a 60-dimensional Euclidian metric; each of these clusters was then divided between the 104-member training set and the 104-member test set.

The actual training and testing samples used for the "aspect angle dependent" experiments are marked in the data files. The reported performance is an average over 10 runs with this single division of the data set.

A standard back-propagation network was used for all experiments. The network had 60 inputs and 2 output units, one indicating a cylinder and the other a rock. Experiments were run with no hidden units (direct connections from each input to each output) and with a single hidden layer with 2, 3, 6, 12, or 24 units. Each network was trained by 300 epochs over the entire training set.

The weight-update formulas used in this study were slightly different from the standard form. A learning rate of 2.0 and momentum of 0.0 was used. Errors less than 0.2 were treated as zero. Initial weights were uniform random values in the range -0.3 to +0.3.

RESULTS:

For the angle independent experiments, Gorman and Sejnowski report the following results for networks with different numbers of hidden units:

Hidden % Right on Std. % Right on Std. Units Training set Dev. Test Set Dev.


0 89.4 2.1 77.1 8.3 2 96.5 0.7 81.9 6.2 3 98.8 0.4 82.0 7.3 6 99.7 0.2 83.5 5.6 12 99.8 0.1 84.7 5.7 24 99.8 0.1 84.5 5.7

For the angle-dependent experiments Gorman and Sejnowski report the following results:

Hidden % Right on Std. % Right on Std. Units Training set Dev. Test Set Dev.


0 79.3 3.4 73.1 4.8 2 96.2 2.2 85.7 6.3 3 98.1 1.5 87.6 3.0 6 99.4 0.9 89.3 2.4 12 99.8 0.6 90.4 1.8 24 100.0 0.0 89.2 1.4

Not surprisingly, the network's performance on the test set was somewhat better when the aspect angles in the training and test sets were balanced.

Gorman and Sejnowski further report that a nearest neighbor classifier on the same data gave an 82.7% probability of correct classification.

Three trained human subjects were each tested on 100 signals, chosen at random from the set of 208 returns used to create this data set. Their responses ranged between 88% and 97% correct. However, they may have been using information from the raw sonar signal that is not preserved in the processed data sets presented here.

REFERENCES:

  1. Gorman, R. P., and Sejnowski, T. J. (1988). "Analysis of Hidden Units in a Layered Network Trained to Classify Sonar Targets" in Neural Networks, Vol. 1, pp. 75-89.

Relabeled values in attribute 'Class' From: R To: Rock
From: M To: Mine