Claude Opus 4.5 on HumanEval: 92

benchmark-result

Metadata

Source Table	`benchmark_results`
Source ID	`I4H3zQ7VJK`
Source URL	automatio.ai/models/claude-opus-4-5
Parent	HumanEval
Children	—
Created	Apr 24, 2026, 7:48 PM
Updated	Apr 24, 2026, 7:48 PM
Synced	Apr 24, 2026, 7:48 PM

`id`	I4H3zQ7VJK
`benchmarkId`	vxX2rorgxU
`modelId`	Claude Opus 4.5(ai-model)
`score`	92
`unit`	percent
`date`	2025-11-24
`sourceUrl`	automatio.ai/models/claude-opus-4-5
`notes`	HumanEval - Python function implementation benchmark
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed99% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: I4H3zQ7VJK

Source Table: benchmark_results

Source ID: I4H3zQ7VJK

Parent Thing ID: vxX2rorgxU