MathVista
MultimodalA benchmark for mathematical reasoning in visual contexts — combines visual understanding with mathematical problem-solving across geometry, charts, scientific figures, and more.
Models Tested
3
Best Score
73.9%
Median Score
67.7%
Scoring: accuracy
Introduced: 2023-10
Maintainer: UCLA / Microsoft Research
Leaderboard3 models
| # | Model | Developer | Score |
|---|---|---|---|
| 🥇 | o1 | OpenAI | 73.9% |
| 🥈 | Claude 3.5 Sonnet | Anthropic | 67.7% |
| 🥉 | GPT-4o | OpenAI | 63.8% |