Skip to content
Longterm Wiki
Index
Fact·f_5MsUCUfbSw·Fact

OpenAI — Benchmark Score: 71.7

Verdictconfirmed95%
1 check · 5/18/2026

1 → confirmed

Our claim

entire record
Subject
OpenAI
Property
Benchmark Score
Value
71.7
As Of
September 2024
Notes
o1 model on SWE-bench Verified.

Source evidence

1 src · 1 check
confirmed95%primaryHaiku 4.5 · 5/18/2026

NoteThe source directly confirms the claimed benchmark score of 71.7% for software programming. The source explicitly attributes this score to OpenAI's o3 model and notes it outperforms the o1 model. The article was published January 24, 2025, which is after the claimed date of 2024-09, but the source is discussing o3's announced performance metrics from December 2024. The claim's additional context about 'o1 model on SWE-bench' is slightly different from the source's framing (which attributes 71.7% to o3, not o1), but the 71.7% figure itself is confirmed as accurate for software programming performance.

Case № f_5MsUCUfbSwFiled 5/18/2026Confidence 95%