Updated

openml-datasets

2.1K
API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files
Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/core/openml-datasets/
https://datahub.io/core/openml-datasets/_r/-/FRESHNESS_CHECK.md
https://datahub.io/core/openml-datasets/_r/-/README.md
https://datahub.io/core/openml-datasets/_r/-/UPDATE_SCRIPT_MAINTENANCE_REPORT.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.arff
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/Bioresponse.csv
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.arff
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/Click_prediction_small.csv
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Click_prediction_small/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.arff
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/IMDB.drama.csv
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/README.md
https://datahub.io/core/openml-datasets/_r/-/data/IMDB.drama/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.arff
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/MagicTelescope.csv
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/README.md
https://datahub.io/core/openml-datasets/_r/-/data/MagicTelescope/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/README.md
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.arff
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/Satellite.csv
https://datahub.io/core/openml-datasets/_r/-/data/Satellite/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/README.md
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.arff
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/SpeedDating.csv
https://datahub.io/core/openml-datasets/_r/-/data/SpeedDating/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/abalone/README.md
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.arff
https://datahub.io/core/openml-datasets/_r/-/data/abalone/abalone.csv
https://datahub.io/core/openml-datasets/_r/-/data/abalone/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/adult/README.md
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.arff
https://datahub.io/core/openml-datasets/_r/-/data/adult/adult.csv
https://datahub.io/core/openml-datasets/_r/-/data/adult/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/airlines/README.md
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.arff
https://datahub.io/core/openml-datasets/_r/-/data/airlines/airlines.csv
https://datahub.io/core/openml-datasets/_r/-/data/airlines/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/README.md
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.arff
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/amazon-commerce-reviews.csv
https://datahub.io/core/openml-datasets/_r/-/data/amazon-commerce-reviews/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/anneal/README.md
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.arff
https://datahub.io/core/openml-datasets/_r/-/data/anneal/anneal.csv
https://datahub.io/core/openml-datasets/_r/-/data/anneal/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/README.md
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.arff
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/arrhythmia.csv
https://datahub.io/core/openml-datasets/_r/-/data/arrhythmia/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.arff
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/autoUniv-au6-1000.csv
https://datahub.io/core/openml-datasets/_r/-/data/autoUniv-au6-1000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/autos/README.md
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.arff
https://datahub.io/core/openml-datasets/_r/-/data/autos/autos.csv
https://datahub.io/core/openml-datasets/_r/-/data/autos/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/README.md
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.arff
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/bank-marketing.csv
https://datahub.io/core/openml-datasets/_r/-/data/bank-marketing/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/README.md
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.arff
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/banknote-authentication.csv
https://datahub.io/core/openml-datasets/_r/-/data/banknote-authentication/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/README.md
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.arff
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/blood-transfusion-service-center.csv
https://datahub.io/core/openml-datasets/_r/-/data/blood-transfusion-service-center/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/breast-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/README.md
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.arff
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/breast-w.csv
https://datahub.io/core/openml-datasets/_r/-/data/breast-w/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.arff
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/cardiotocography.csv
https://datahub.io/core/openml-datasets/_r/-/data/cardiotocography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.arff
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/climate-model-simulation-crashes.csv
https://datahub.io/core/openml-datasets/_r/-/data/climate-model-simulation-crashes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/cmc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.arff
https://datahub.io/core/openml-datasets/_r/-/data/cmc/cmc.csv
https://datahub.io/core/openml-datasets/_r/-/data/cmc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/credit-approval.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-approval/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/README.md
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.arff
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/credit-g.csv
https://datahub.io/core/openml-datasets/_r/-/data/credit-g/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/README.md
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.arff
https://datahub.io/core/openml-datasets/_r/-/data/diabetes/diabetes.csv
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/README.md
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.arff
https://datahub.io/core/openml-datasets/_r/-/data/eeg-eye-state/eeg-eye-state.csv
https://datahub.io/core/openml-datasets/_r/-/data/electricity/README.md
https://datahub.io/core/openml-datasets/_r/-/data/electricity/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.arff
https://datahub.io/core/openml-datasets/_r/-/data/electricity/electricity.csv
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.arff
https://datahub.io/core/openml-datasets/_r/-/data/fbis.wc/fbis.wc.csv
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/README.md
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.arff
https://datahub.io/core/openml-datasets/_r/-/data/first-order-theorem-proving/first-order-theorem-proving.csv
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.arff
https://datahub.io/core/openml-datasets/_r/-/data/gas-drift/gas-drift.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_agnostic/gina_agnostic.csv
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.arff
https://datahub.io/core/openml-datasets/_r/-/data/gina_prior2/gina_prior2.csv
https://datahub.io/core/openml-datasets/_r/-/data/glass/README.md
https://datahub.io/core/openml-datasets/_r/-/data/glass/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.arff
https://datahub.io/core/openml-datasets/_r/-/data/glass/glass.csv
https://datahub.io/core/openml-datasets/_r/-/data/haberman/README.md
https://datahub.io/core/openml-datasets/_r/-/data/haberman/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.arff
https://datahub.io/core/openml-datasets/_r/-/data/haberman/haberman.csv
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/README.md
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.arff
https://datahub.io/core/openml-datasets/_r/-/data/heart-statlog/heart-statlog.csv
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/README.md
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.arff
https://datahub.io/core/openml-datasets/_r/-/data/hill-valley/hill-valley.csv
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.arff
https://datahub.io/core/openml-datasets/_r/-/data/ilpd/ilpd.csv
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/README.md
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.arff
https://datahub.io/core/openml-datasets/_r/-/data/ionosphere/ionosphere.csv
https://datahub.io/core/openml-datasets/_r/-/data/iris/README.md
https://datahub.io/core/openml-datasets/_r/-/data/iris/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.arff
https://datahub.io/core/openml-datasets/_r/-/data/iris/iris.csv
https://datahub.io/core/openml-datasets/_r/-/data/isolet/README.md
https://datahub.io/core/openml-datasets/_r/-/data/isolet/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.arff
https://datahub.io/core/openml-datasets/_r/-/data/isolet/isolet.csv
https://datahub.io/core/openml-datasets/_r/-/data/jm1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/jm1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.arff
https://datahub.io/core/openml-datasets/_r/-/data/jm1/jm1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc1/kc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/kc2/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kc2/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.arff
https://datahub.io/core/openml-datasets/_r/-/data/kc2/kc2.csv
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.arff
https://datahub.io/core/openml-datasets/_r/-/data/kr-vs-kp/kr-vs-kp.csv
https://datahub.io/core/openml-datasets/_r/-/data/kropt/README.md
https://datahub.io/core/openml-datasets/_r/-/data/kropt/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.arff
https://datahub.io/core/openml-datasets/_r/-/data/kropt/kropt.csv
https://datahub.io/core/openml-datasets/_r/-/data/letter/README.md
https://datahub.io/core/openml-datasets/_r/-/data/letter/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.arff
https://datahub.io/core/openml-datasets/_r/-/data/letter/letter.csv
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/README.md
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.arff
https://datahub.io/core/openml-datasets/_r/-/data/liver-disorders/liver-disorders.csv
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.arff
https://datahub.io/core/openml-datasets/_r/-/data/lung-cancer/lung-cancer.csv
https://datahub.io/core/openml-datasets/_r/-/data/lymph/README.md
https://datahub.io/core/openml-datasets/_r/-/data/lymph/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.arff
https://datahub.io/core/openml-datasets/_r/-/data/lymph/lymph.csv
https://datahub.io/core/openml-datasets/_r/-/data/madelon/README.md
https://datahub.io/core/openml-datasets/_r/-/data/madelon/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.arff
https://datahub.io/core/openml-datasets/_r/-/data/madelon/madelon.csv
https://datahub.io/core/openml-datasets/_r/-/data/mammography/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mammography/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.arff
https://datahub.io/core/openml-datasets/_r/-/data/mammography/mammography.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-factors/mfeat-factors.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-karhunen/mfeat-karhunen.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-morphological/mfeat-morphological.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-pixel/mfeat-pixel.csv
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.arff
https://datahub.io/core/openml-datasets/_r/-/data/mfeat-zernike/mfeat-zernike.csv
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.arff
https://datahub.io/core/openml-datasets/_r/-/data/mozilla4/mozilla4.csv
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/README.md
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.arff
https://datahub.io/core/openml-datasets/_r/-/data/mushroom/mushroom.csv
https://datahub.io/core/openml-datasets/_r/-/data/musk/README.md
https://datahub.io/core/openml-datasets/_r/-/data/musk/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.arff
https://datahub.io/core/openml-datasets/_r/-/data/musk/musk.csv
https://datahub.io/core/openml-datasets/_r/-/data/nursery/README.md
https://datahub.io/core/openml-datasets/_r/-/data/nursery/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.arff
https://datahub.io/core/openml-datasets/_r/-/data/nursery/nursery.csv
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/README.md
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.arff
https://datahub.io/core/openml-datasets/_r/-/data/oil_spill/oil_spill.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-shape/one-hundred-plants-shape.csv
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/README.md
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.arff
https://datahub.io/core/openml-datasets/_r/-/data/one-hundred-plants-texture/one-hundred-plants-texture.csv
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/optdigits/optdigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/README.md
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.arff
https://datahub.io/core/openml-datasets/_r/-/data/page-blocks/page-blocks.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc1/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc1/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc1/pc1.csv
https://datahub.io/core/openml-datasets/_r/-/data/pc3/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pc3/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.arff
https://datahub.io/core/openml-datasets/_r/-/data/pc3/pc3.csv
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/README.md
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.arff
https://datahub.io/core/openml-datasets/_r/-/data/pendigits/pendigits.csv
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/README.md
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.arff
https://datahub.io/core/openml-datasets/_r/-/data/phoneme/phoneme.csv
https://datahub.io/core/openml-datasets/_r/-/data/profb/README.md
https://datahub.io/core/openml-datasets/_r/-/data/profb/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.arff
https://datahub.io/core/openml-datasets/_r/-/data/profb/profb.csv
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/README.md
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.arff
https://datahub.io/core/openml-datasets/_r/-/data/qsar-biodeg/qsar-biodeg.csv
https://datahub.io/core/openml-datasets/_r/-/data/satimage/README.md
https://datahub.io/core/openml-datasets/_r/-/data/satimage/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.arff
https://datahub.io/core/openml-datasets/_r/-/data/satimage/satimage.csv
https://datahub.io/core/openml-datasets/_r/-/data/scene/README.md
https://datahub.io/core/openml-datasets/_r/-/data/scene/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.arff
https://datahub.io/core/openml-datasets/_r/-/data/scene/scene.csv
https://datahub.io/core/openml-datasets/_r/-/data/segment/README.md
https://datahub.io/core/openml-datasets/_r/-/data/segment/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.arff
https://datahub.io/core/openml-datasets/_r/-/data/segment/segment.csv
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/README.md
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.arff
https://datahub.io/core/openml-datasets/_r/-/data/seismic-bumps/seismic-bumps.csv
https://datahub.io/core/openml-datasets/_r/-/data/semeion/README.md
https://datahub.io/core/openml-datasets/_r/-/data/semeion/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.arff
https://datahub.io/core/openml-datasets/_r/-/data/semeion/semeion.csv
https://datahub.io/core/openml-datasets/_r/-/data/sick/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sick/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.arff
https://datahub.io/core/openml-datasets/_r/-/data/sick/sick.csv
https://datahub.io/core/openml-datasets/_r/-/data/sonar/README.md
https://datahub.io/core/openml-datasets/_r/-/data/sonar/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.arff
https://datahub.io/core/openml-datasets/_r/-/data/sonar/sonar.csv
https://datahub.io/core/openml-datasets/_r/-/data/soybean/README.md
https://datahub.io/core/openml-datasets/_r/-/data/soybean/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.arff
https://datahub.io/core/openml-datasets/_r/-/data/soybean/soybean.csv
https://datahub.io/core/openml-datasets/_r/-/data/spambase/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spambase/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.arff
https://datahub.io/core/openml-datasets/_r/-/data/spambase/spambase.csv
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/README.md
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.arff
https://datahub.io/core/openml-datasets/_r/-/data/spectrometer/spectrometer.csv
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/README.md
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.arff
https://datahub.io/core/openml-datasets/_r/-/data/steel-plates-fault/steel-plates-fault.csv
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/README.md
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.arff
https://datahub.io/core/openml-datasets/_r/-/data/tic-tac-toe/tic-tac-toe.csv
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.arff
https://datahub.io/core/openml-datasets/_r/-/data/vehicle/vehicle.csv
https://datahub.io/core/openml-datasets/_r/-/data/vote/README.md
https://datahub.io/core/openml-datasets/_r/-/data/vote/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.arff
https://datahub.io/core/openml-datasets/_r/-/data/vote/vote.csv
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.arff
https://datahub.io/core/openml-datasets/_r/-/data/wall-robot-navigation/wall-robot-navigation.csv
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/README.md
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.arff
https://datahub.io/core/openml-datasets/_r/-/data/waveform-5000/waveform-5000.csv
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/README.md
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.arff
https://datahub.io/core/openml-datasets/_r/-/data/wdbc/wdbc.csv
https://datahub.io/core/openml-datasets/_r/-/data/yeast/README.md
https://datahub.io/core/openml-datasets/_r/-/data/yeast/datapackage.json
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.arff
https://datahub.io/core/openml-datasets/_r/-/data/yeast/yeast.csv
https://datahub.io/core/openml-datasets/_r/-/datapackage.json
Key Files

Start with these files — they give you everything you need to understand and access the dataset.

https://datahub.io/core/openml-datasets/_r/-/data/Bioresponse/datapackage.json
README.mddocumentation
https://datahub.io/core/openml-datasets/_r/-/README.md
Typical Usage
  1. 1. Fetch data/Bioresponse/datapackage.json to inspect schema and resources
  2. 2. Download data resources listed in data/Bioresponse/datapackage.json
  3. 3. Read README.md for full context

Data Previews

spambase

Unsupported data preview format `arff`

spambase

Loading data...

Schema

nametypeformat
word_freq_makenumberdefault
word_freq_addressnumberdefault
word_freq_allanydefault
word_freq_3dnumberdefault
word_freq_ournumberdefault
word_freq_overnumberdefault
word_freq_removeanydefault
word_freq_internetnumberdefault
word_freq_ordernumberdefault
word_freq_mailnumberdefault
word_freq_receiveanydefault
word_freq_willnumberdefault
word_freq_peoplenumberdefault
word_freq_reportnumberdefault
word_freq_addressesnumberdefault
word_freq_freeanydefault
word_freq_businessnumberdefault
word_freq_emailanydefault
word_freq_younumberdefault
word_freq_creditnumberdefault
word_freq_yournumberdefault
word_freq_fontnumberdefault
word_freq_000numberdefault
word_freq_moneynumberdefault
word_freq_hpnumberdefault
word_freq_hplnumberdefault
word_freq_georgenumberdefault
word_freq_650numberdefault
word_freq_labnumberdefault
word_freq_labsnumberdefault
word_freq_telnetnumberdefault
word_freq_857numberdefault
word_freq_datanumberdefault
word_freq_415numberdefault
word_freq_85numberdefault
word_freq_technologynumberdefault
word_freq_1999numberdefault
word_freq_partsnumberdefault
word_freq_pmnumberdefault
word_freq_directnumberdefault
word_freq_csnumberdefault
word_freq_meetingnumberdefault
word_freq_originalnumberdefault
word_freq_projectnumberdefault
word_freq_renumberdefault
word_freq_edunumberdefault
word_freq_tablenumberdefault
word_freq_conferencenumberdefault
char_freq_%3Bnumberdefault
char_freq_%28numberdefault
char_freq_%5Bnumberdefault
char_freq_%21numberdefault
char_freq_%24anydefault
char_freq_%23numberdefault
capital_run_length_averagenumberdefault
capital_run_length_longestnumberdefault
capital_run_length_totalnumberdefault
classnumberdefault

Data Files

FileDescriptionSizeLast modifiedDownload
spambase
707 kB3 months ago
spambase
spambase
699 kB3 months ago
spambase
FilesSizeFormatCreatedUpdatedLicenseSource
21.41 MBarff, csvabout 2 months agoOpen Data Commons Public Domain Dedication and License

The resources for this dataset can be found at https://www.openml.org/d/44

Author: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt
Source: UCI
Please cite: UCI

SPAM E-mail Database
The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography… Our collection of spam e-mails came from our postmaster and individuals who had filed spam. Our collection of non-spam e-mails came from filed work and personal e-mails, and hence the word 'george' and the area code '650' are indicators of non-spam. These are useful when constructing a personalized spam filter. One would either have to blind such non-spam indicators or get a very wide collection of non-spam to generate a general purpose spam filter.

For background on spam:
Cranor, Lorrie F., LaMacchia, Brian A. Spam! Communications of the ACM, 41(8):74-83, 1998.

Attribute Information:

The last column denotes whether the e-mail was considered spam (1) or not (0), i.e. unsolicited commercial e-mail. Most of the attributes indicate whether a particular word or character was frequently occurring in the e-mail. The run-length attributes (55-57) measure the length of sequences of consecutive capital letters.

For the statistical measures of each attribute, see the end of this file. Here are the definitions of the attributes:

48 continuous real [0,100] attributes of type
word_freq_WORD = percentage of words in the e-mail that match WORD, i.e. 100 * (number of times the WORD appears in the e-mail) / total number of words in e-mail. A "word" in this case is any string of alphanumeric characters bounded by non-alphanumeric characters or end-of-string.

6 continuous real [0,100] attributes of type char_freq_CHAR = percentage of characters in the e-mail that match CHAR, i.e. 100 * (number of CHAR occurences) / total characters in e-mail

1 continuous real [1,…] attribute of type capital_run_length_average = average length of uninterrupted sequences of capital letters

1 continuous integer [1,…] attribute of type capital_run_length_longest = length of longest uninterrupted sequence of capital letters

1 continuous integer [1,…] attribute of type capital_run_length_total = sum of length of uninterrupted sequences of capital letters = total number of capital letters in the e-mail

1 nominal 1 class attribute of type spam = denotes whether the e-mail was considered spam (1) or not (0), i.e. unsolicited commercial e-mail.