Index
OpenAI — Benchmark Score: 71.7
Verdictconfirmed95%
1 check · 5/18/20261 → confirmed
Our claim
entire record- Subject
- OpenAI
- Property
- Benchmark Score
- Value
- 71.7
- As Of
- September 2024
- Notes
- o1 model on SWE-bench Verified.
Source evidence
1 src · 1 checkplantemoran.com/explore-our-thinking/insight/2025/01/unveiling-openai-o3-from-benchmarks-to-real-worldresource
confirmed95%primaryHaiku 4.5 · 5/18/2026
NoteThe source directly confirms the claimed benchmark score of 71.7% for software programming. The source explicitly attributes this score to OpenAI's o3 model and notes it outperforms the o1 model. The article was published January 24, 2025, which is after the claimed date of 2024-09, but the source is discussing o3's announced performance metrics from December 2024. The claim's additional context about 'o1 model on SWE-bench' is slightly different from the source's framing (which attributes 71.7% to o3, not o1), but the 71.7% figure itself is confirmed as accurate for software programming performance.
Case № f_5MsUCUfbSwFiled 5/18/2026Confidence 95%