Claude Opus 4.6 on HumanEval: 95.4

benchmark-result

Metadata

Source Table	`benchmark_results`
Source ID	`3TppXW451A`
Source URL	automatio.ai/models/claude-opus-4-6
Parent	HumanEval
Children	—
Created	Apr 24, 2026, 7:51 PM
Updated	Apr 24, 2026, 7:51 PM
Synced	Apr 24, 2026, 7:51 PM

`id`	3TppXW451A
`benchmarkId`	vxX2rorgxU
`modelId`	Claude Opus 4.6(ai-model)
`score`	95.4
`unit`	percent
`date`	2026-02-05
`sourceUrl`	automatio.ai/models/claude-opus-4-6
`notes`	—
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed99% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: 3TppXW451A

Source Table: benchmark_results

Source ID: 3TppXW451A

Parent Thing ID: vxX2rorgxU