Epoch Data on AI Models

1,766
0
Updated:
Files:4
Size:8.85 MB
Formats:csv
License:CC-BY-4.0

Comprehensive database of over 2800 AI/ML models tracking key factors driving machine learning progress, including parameters, training compute, training dataset size, publication date, organization, and more. Sourced from Epoch AI.

API Access

Access dataset files directly from scripts, code, or AI agents.

Browse dataset files
Dataset Files

Each file has a stable URL (r-link) that you can use directly in scripts, apps, or AI agents. These URLs are permanent and safe to hardcode.

/ai/epoch-data-on-ai-models/
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/AGENTS.md
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/README.md
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/data/all_ai_models.csv
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/data/frontier_ai_models.csv
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/data/large_scale_ai_models.csv
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/data/notable_ai_models.csv
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/datapackage.json
Key Files

Start with these files — they give you everything you need to understand and access the dataset.

datapackage.jsonmetadata & schema
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/datapackage.json
README.mddocumentation
https://datahub.io/ai/epoch-data-on-ai-models/_r/-/README.md
Typical Usage
  1. 1. Fetch datapackage.json to inspect schema and resources
  2. 2. Download data resources listed in datapackage.json
  3. 3. Read README.md for full context

Data Views

Data Files

Explore with AI

All AI Models

Loading data...

Download

Download CSV

About

All AI models in the Epoch database (~21,000 entries).
Last updated
19 March 2026
Total rows
...
Format
CSV
File size
5.72 MB

Notable AI Models

Loading data...

Download

Download CSV

About

Subset of notable AI models with richer metadata (~7,400 entries).
Last updated
19 March 2026
Total rows
...
Format
CSV
File size
1.85 MB

Large-Scale AI Models

Loading data...

Download

Download CSV

About

Large-scale AI models subset (~3,600 entries).
Last updated
19 March 2026
Total rows
...
Format
CSV
File size
902 kB

Frontier AI Models

Loading data...

Download

Download CSV

About

Frontier AI models subset — the most capable models at each point in time (~1,600 entries).
Last updated
19 March 2026
Total rows
...
Format
CSV
File size
371 kB

About this dataset

Epoch Data on AI Models

Comprehensive database of AI/ML models tracking key factors driving machine learning progress. Sourced from Epoch AI.

Data

The dataset is split into four CSV files covering different subsets of models:

FileDescriptionRows
data/all_ai_models.csvAll AI models in the Epoch database~21,600
data/notable_ai_models.csvNotable models with richer metadata~7,400
data/large_scale_ai_models.csvLarge-scale models subset~3,600
data/frontier_ai_models.csvFrontier models — most capable at each point in time~1,600

Key fields

  • Model — name of the model
  • Organization — developing organization (e.g. Google, OpenAI, DeepMind)
  • Publication date — date of release or publication (1950–2025)
  • Domain — area of application (Language, Vision, Multimodal, Robotics, etc.)
  • Task — specific task(s) the model performs
  • Parameters — number of model parameters
  • Training compute (FLOP) — total training compute in floating point operations
  • Training dataset — name/description of training data
  • Training dataset size (datapoints) — number of training examples
  • Training hardware — hardware used for training
  • Training compute cost (2023 USD) — estimated cost of training
  • Frontier model — whether the model was state-of-the-art at release
  • Model accessibility — open weights, closed, API-only, etc.

Full field-level documentation is in datapackage.json.

License

Creative Commons Attribution 4.0 (CC-BY-4.0) — Epoch AI.

Source

Epoch AI — Notable AI Models: https://epochai.org/data/notable-ai-models