Skip to content
Longterm Wiki

LiveCodeBench

Coding

A contamination-free coding benchmark using new competitive programming problems from LeetCode, AtCoder, and Codeforces. Problems are refreshed continuously to prevent data leakage.

Models Tested
9
Best Score
79.4%
Median Score
63.4%
Scoring: pass_at_1
Introduced: 2024-06
Maintainer: LiveCodeBench Team

Leaderboard9 models

#ModelDeveloperScore
🥇Grok-3xAI
79.4%
🥈o3OpenAI
71.7%
🥉o4-miniOpenAI
67.8%
4DeepSeek R1DeepSeek
65.9%
5Gemini 2.5 ProGoogle DeepMind
63.4%
6o3-miniOpenAI
57.6%
7Llama 4 MaverickMeta AI (FAIR)
43.4%
8DeepSeek V3DeepSeek
40.5%
9Llama 4 ScoutMeta AI (FAIR)
32.8%