Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Benchmarks
/
ARC-AGI
ARC-AGI
Reasoning
Wiki page
Website
Data
Abstraction and Reasoning Corpus — a benchmark of visual pattern recognition tasks designed to test fluid intelligence and novel reasoning.
Models Tested
2
Best Score
87.5%
Median Score
85.05%
Scoring:
accuracy
Introduced:
2019-11
Maintainer:
Francois Chollet
Leaderboard
(2 models)
#
Model
Developer
Score
🥇
o3
OpenAI
87.5%
🥈
o4-mini
OpenAI
82.6%