Aider Polyglot

Coding

A multi-language coding benchmark that tests AI models on real-world code editing tasks across Python, JavaScript, TypeScript, C#, Java, Ruby, Go, C++, PHP, and Rust.

Models Tested

Best Score

9.8

Median Score

9.8

Scoring: percentage

Introduced: 2024-09

Maintainer: Aider

Leaderboard (1 model)

#	Model	Developer	Score
🥇	GPT-4.1 nano	OpenAI	9.8