Scorecard_grade·ailabwatch-2025-09|sid_A4XoubikkQ|scheming·Record·Profile

Scorecard: ailabwatch 2025-09-01 scored Google DeepMind on Scheming risk prevention = Very Weak

Verdictconfirmed95%

1 check · 6/29/2026

1 → confirmed

Our claim

entire record

Snapshot: ailabwatch-2025-09
Entity: Google DeepMind
Dimension Slug: scheming
Dimension Label: Scheming risk prevention
Score Numeric: 8
Score Letter: Very Weak
Score Raw: 8%

Source evidence

1 src · 1 check

ailabwatch.org/resource

confirmed95%Haiku 4.5 · 6/29/2026

NoteThe source directly confirms all key fields: (1) publisher is 'AI Lab Watch', (2) the date is September 2025 (the claim specifies 2025-09-01, source says 'as of September 15' — both within the same month, so temporal alignment is acceptable), (3) entity is 'DeepMind' (shown in the column header), (4) dimension is 'Scheming risk prevention' (shown in the row header), (5) scoreRaw is 8% (shown in the DeepMind column for that row). The scoreLetter 'Very Weak' is a reasonable qualitative interpretation of 8%, consistent with typical grading scales. The scoreNumeric value of 8 matches the raw percentage score of 8%.

Case № ailabwatch-2025-09|sid_A4XoubikkQ|schemingFiled 6/29/2026Confidence 95%