Skip to content
Longterm Wiki

Claude Opus 4.6 on HumanEval: 95.4

benchmark-resultVerified

Child of HumanEval

Metadata

Source Tablebenchmark_results
Source ID3TppXW451A
Source URLautomatio.ai/models/claude-opus-4-6
ParentHumanEval
Children
CreatedApr 24, 2026, 7:51 PM
UpdatedApr 24, 2026, 7:51 PM
SyncedApr 24, 2026, 7:51 PM

Record Data

id3TppXW451A
benchmarkIdvxX2rorgxU
modelIdClaude Opus 4.6(ai-model)
score95.4
unitpercent
date2026-02-05
sourceUrlautomatio.ai/models/claude-opus-4-6
notes
testedByunknown
testedByOrgId
evaluationDate
methodologyNotes

Source Check Verdicts

confirmed99% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: 3TppXW451A

Source Table: benchmark_results

Source ID: 3TppXW451A

Parent Thing ID: vxX2rorgxU