Skip to content
Longterm Wiki

Aider Polyglot

Coding
A multi-language coding benchmark that tests AI models on real-world code editing tasks across Python, JavaScript, TypeScript, C#, Java, Ruby, Go, C++, PHP, and Rust.
Models Tested
1
Best Score
9.8
Median Score
9.8
Scoring: percentage
Introduced: 2024-09
Maintainer: Aider

Leaderboard (1 model)

#ModelDeveloperScore
🥇GPT-4.1 nanoOpenAI
9.8