Skip to content
Longterm Wiki

MathVista

Multimodal

A benchmark for mathematical reasoning in visual contexts — combines visual understanding with mathematical problem-solving across geometry, charts, scientific figures, and more.

Models Tested
3
Best Score
73.9%
Median Score
67.7%
Scoring: accuracy
Introduced: 2023-10
Maintainer: UCLA / Microsoft Research

Leaderboard3 models

#ModelDeveloperScore
🥇o1OpenAI
73.9%
🥈Claude 3.5 SonnetAnthropic
67.7%
🥉GPT-4oOpenAI
63.8%