Skip to content
Longterm Wiki

FrontierMath

Math
Research-grade mathematics benchmark from Epoch AI featuring original problems created by professional mathematicians. Designed to remain unsaturated for years.
Models Tested
1
Best Score
4.5
Median Score
4.5
Scoring: accuracy
Introduced: 2024-11
Maintainer: Epoch AI

Leaderboard (1 model)

#ModelDeveloperScore
🥇GPT-4.1 miniOpenAI
4.5