The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference

paper

2025·arXiv·arxiv.org/html/2511.23455v1

Authors

Winston Wei Dou·Itay Goldstein·Yan Ji

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: arXiv

Relevant to AI safety and governance discussions about capability trajectories and accessibility; rapid cost declines affect when AI systems become broadly deployable and who can access frontier capabilities.

Paper Details

Citations

1 influential

Year

2025

Methodology

report

arXiv:2511.23455 DOI:10.3386/w34054 Semantic Scholar

Metadata

Importance: 62/100arxiv preprintanalysis

Summary

This paper quantifies the rate at which AI inference costs are falling for a given level of benchmark performance, finding 5-10× annual price reductions for frontier models across knowledge, reasoning, math, and coding benchmarks. After controlling for hardware improvements and market competition, the authors attribute approximately 3× annual cost reduction to algorithmic efficiency gains alone. They recommend that benchmark evaluations incorporate pricing data to better capture real-world practical impact.

Key Points

•Inference cost for a given benchmark performance level has dropped 5-10× per year for frontier models across knowledge, reasoning, math, and software engineering tasks.
•Algorithmic efficiency improvements alone account for ~3× annual cost reduction after controlling for hardware price declines and competition effects.
•The study uses the largest known historical dataset of AI benchmarking prices, combining Epoch AI and Artificial Analysis data via Internet Archive snapshots.
•Current benchmarks present a distorted picture of AI progress by ignoring cost, potentially overstating practical capability gains from expensive models.
•Authors recommend that evaluators publicize and incorporate model pricing as a core metric alongside performance scores.

Cited by 2 pages

Page	Type	Quality
AI Timelines	Concept	95.0
Compute Thresholds	Concept	91.0

Cached Content Preview

HTTP 200Fetched Apr 7, 202627 KB

The Price of Progress Algorithmic Efficiency and the Falling Cost of AI Inference 
 
 
 

 
 

 
 
 
 
 
 The Price of Progress
 
 Algorithmic Efficiency and the Falling Cost of AI Inference

 
 
 
Hans Gundlach 
 MIT CSAIL, MIT FutureTech 
 hansgund@mit.edu 
&Jayson Lynch 
 MIT CSAIL, MIT FutureTech 
 jaysonl@mit.edu 
&Matthias Mertens 
 MIT Sloan, MIT FutureTech 
 mmertens@mit.edu 
&Neil Thompson 1 1 footnotemark: 1 
 MIT CSAIL, MIT FutureTech 
 neil_t@mit.edu 
 Corresponding authors. hansgund@mit.edu, neil_t@mit.edu 
 
 
 Abstract

 Language models have seen enormous progress on advanced benchmarks in recent years, but much of this progress has only been possible by using more costly models. Benchmarks may therefore present a warped picture of progress in practical capabilities per dollar . To remedy this, we use data from Artificial Analysis and Epoch AI to form the largest dataset of current and historical prices to run benchmarks to date. We find that the price for a given level of benchmark performance has decreased remarkably fast, around 5 × 5\times to 10 × 10\times per year, for frontier models on knowledge, reasoning, math, and software engineering benchmarks.
These reductions in the cost of AI inference are due to economic forces, hardware efficiency improvements, and algorithmic efficiency improvements.
Isolating out open models to control for competition effects and dividing by hardware price declines, we estimate that algorithmic efficiency progress is around 3 × 3\times per year. Finally, we recommend that evaluators both publicize and take into account the price of benchmarking as an essential part of measuring the real-world impact of AI. 1 1 1 This paper was accepted to the NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle: Benchmarks, Emergent Abilities, and Scaling
( https://sites.google.com/view/llm-eval-workshop ). 

 
 
 
 1 Introduction

 
 A critical dimension of the real-world impact of language models (and AI systems in general) is cost, which is often ignored in discourses around evaluations and AI performance. Popular blog posts have ignited discussions on potentially large declines in prices for accessing a given level of (high) LLM performance ( appenzeller2024llmflation ) . At the same time, epoch2025llminferencepricetrends finds that, controlling for benchmark performance, LLM token prices may be decreasing by factors of 10–1,000 × \times per year, depending on the performance level. On the other hand, erol2025cost finds that the cost-of-pass on benchmarks like MATH 500 and AIME 2024 has gone down by 24.5 × \times and 3.23 × \times per year, respectively. Understanding these price trends is key for many issues, such as predicting the cost efficiency of models versus labor-based work, or democratizing access to state-of-the-art AI capabilities. 2 2 2 Documenting changes in quality-adjusted AI prices is also relevant to economic science, as these data can be helpful for understanding substitution elast

... (truncated, 27 KB total)

Resource ID: 2255e8e1cf26d155 | Stable ID: sid_4nTlxS0IAp