Models Tested
6
Best Score
1444
Median Score
1376
Scoring: elo
Introduced: 2023-05
Maintainer: LMArena (formerly LMSYS)
Leaderboard (6 models)
| # | Model | Developer | Score |
|---|---|---|---|
| 🥇 | Gemini 2.5 Pro | Google DeepMind | 1444 |
| 🥈 | Grok | xAI | 1402 |
| 🥉 | Grok-3 | xAI | 1402 |
| 4 | o1 | OpenAI | 1350 |
| 5 | GPT-4o | OpenAI | 1285 |
| 6 | Claude 3.5 Sonnet | Anthropic | 1271 |