Skip to content
Longterm Wiki

Claude Haiku 4.5 on SWE-bench Verified: 73.3

benchmark-resultVerified

Metadata

Source Tablebenchmark_results
Source ID9iqsCKcMgg
ParentSWE-bench Verified
Children
CreatedApr 24, 2026, 6:42 PM
UpdatedApr 24, 2026, 6:42 PM
SyncedApr 24, 2026, 6:42 PM

Record Data

id9iqsCKcMgg
benchmarkIdWOSlsBTTmV
modelIdClaude Haiku 4.5(ai-model)
score73.3
unitpercent
date2025-10-15
sourceUrl
notesScore reported by Anthropic averaged over 50 runs. One of the highest scores in its price tier. Tested via Anthropic's agent scaffold.
testedByunknown
testedByOrgId
evaluationDate
methodologyNotes

Source Check Verdicts

confirmed98% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: 9iqsCKcMgg

Source Table: benchmark_results

Source ID: 9iqsCKcMgg

Parent Thing ID: WOSlsBTTmV