Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Benchmarks
/
Aider Polyglot
Aider Polyglot
Coding
Wiki page
Website
Data
A multi-language coding benchmark that tests AI models on real-world code editing tasks across Python, JavaScript, TypeScript, C#, Java, Ruby, Go, C++, PHP, and Rust.
Models Tested
1
Best Score
9.8
Median Score
9.8
Scoring:
percentage
Introduced:
2024-09
Maintainer:
Aider
Leaderboard
(1 model)
#
Model
Developer
Score
🥇
GPT-4.1 nano
OpenAI
9.8