Epoch AI: Database of AI Models (Training Compute, Parameters, and Trends)

web

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Epoch AI

EpochDB is a foundational empirical dataset frequently cited in AI safety and governance discussions to contextualize claims about AI progress rates, compute requirements, and capability trajectories.

Metadata

Importance: 72/100dataset

Summary

EpochDB is a comprehensive database maintained by Epoch AI that tracks historical and current AI models, including data on training compute, dataset sizes, parameters, and publication dates. It serves as a key empirical resource for researchers studying AI progress, scaling trends, and forecasting future capabilities. The database enables quantitative analysis of how AI development has evolved over time.

Key Points

•Tracks hundreds of notable AI models with structured data on compute (FLOPs), parameters, training dataset size, and release dates.
•Enables trend analysis and forecasting of AI capability growth, often cited in compute scaling research and policy discussions.
•Maintained by Epoch AI, a research organization focused on empirical analysis of AI progress and trajectories.
•Widely used by researchers and policymakers to ground claims about AI advancement rates in empirical data.
•Supports analysis of compute efficiency improvements and the relationship between resources and model performance over time.

Cited by 1 page

Page	Type	Quality
Epoch AI	Organization	51.0

Cached Content Preview

HTTP 200Fetched Apr 7, 20268 KB

Data on AI Models | Epoch AI 

 
 
 

 

 

 
 Updated Mar. 31, 2026 AI Models

 Our comprehensive database of over 3200 models tracks key factors driving machine learning progress.

 Download this data Notable AI models Frontier AI models Large-scale AI models All AI models 
 
 
 Graph 
 
 
 Table Settings 
 
 
 
 
 
 
 Graph 
 
 
 

 Y Axis Training compute (FLOP) Training compute (FLOP) Parameters Training dataset size (total) Training compute cost (2023 USD) Training time (days) Training power draw (W) 
 
 
 Training compute (FLOP) 
 
 
 
 Parameters 
 
 
 
 Training dataset size (total) 
 
 
 
 Training compute cost (2023 USD) 
 
 
 
 Training time (days) 
 
 
 
 Training power draw (W) 
 
 
 
 X Axis Publication date Publication date Training compute (FLOP) Parameters Training dataset size (total) Training compute cost (2023 USD) Training time (days) Training power draw (W) 
 
 
 Publication date 
 
 
 
 Training compute (FLOP) 
 
 
 
 Parameters 
 
 
 
 Training dataset size (total) 
 
 
 
 Training compute cost (2023 USD) 
 
 
 
 Training time (days) 
 
 
 
 Training power draw (W) 
 
 
 
 Color by None Domain Organization Country Accessibility Display 
 
 
 

 Show Deep Learning Era 
 
 
 Show regressions 
 
 
 Show regresions as Nx/year OOM/year Doubling time Confidence interval width 
 
 
 % Filter 
 
 
 

 Filter by text 
 
 
 Show only top N models, where N is 
 
 
 Apply More about this dataset

 Documentation Downloads Citations FAQ Documentation

 Models in this dataset have been collected from various sources, including literature reviews, Papers With Code, historical accounts, highly-cited publications, proceedings of top conferences, and suggestions from individuals. The list of models is non-exhaustive, but aims to cover most models that were state-of-the-art when released, have over 1000 citations, one million monthly active users, or an equivalent level of historical significance. Additional information about our approach to measuring parameter counts, dataset size, and training compute can be found in the accompanying documentation.

 Read the complete documentation Frequently asked questions

 What is a notable model? 
 
 
 

 A notable model meets any of the following criteria: (i) state-of-the-art improvement on a recognized benchmark; (ii) highly cited (over 1000 citations); (iii) historical relevance; (iv) significant use.

 How was the AI Models dataset created? 
 
 
 

 The dataset was originally created for the report “Compute Trends Across Three Eras of Machine Learning” and has continually grown and expanded since then.

 What are notable, frontier, and large-scale models? 
 
 
 

 We flag models as notable if they advanced the state of the art, achieved many citations in an academic publication, had over a million monthly users, were highly significant historically, or were developed at a cost of over one million dollars. You can learn more about these notability criteria by reading our AI Models Documentation .


... (truncated, 8 KB total)

Resource ID: 07b3dfad309f0eb3 | Stable ID: sid_2Nn2c5XnHI