Claude Opus 4.5 on SimpleQA: 36

benchmark-result

Metadata

Source Table	`benchmark_results`
Source ID	`TFSWZuy9Y8`
Source URL	automatio.ai/models/claude-opus-4-5
Parent	SimpleQA
Children	—
Created	Apr 24, 2026, 7:48 PM
Updated	Apr 24, 2026, 7:48 PM
Synced	Apr 24, 2026, 7:48 PM

`id`	TFSWZuy9Y8
`benchmarkId`	1O19f6j13Z
`modelId`	Claude Opus 4.5(ai-model)
`score`	36
`unit`	percent
`date`	2025-11-24
`sourceUrl`	automatio.ai/models/claude-opus-4-5
`notes`	SimpleQA - factual accuracy benchmark for straightforward questions
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed99% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: TFSWZuy9Y8

Source Table: benchmark_results

Source ID: TFSWZuy9Y8

Parent Thing ID: 1O19f6j13Z