Skip to content
Longterm Wiki
Index
Fact·f_eH6tN2pQ5v·Fact

Evan Hubinger — Notable For: Co-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations

Verdictpartial85%
1 check · 5/18/2026

1 → partial

Our claim

entire record
Subject
Evan Hubinger
Property
Notable For
Value
Co-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations
As Of
March 2026
Notes
Enriched from wiki page

Source evidence

1 src · 1 check
partial85%primaryHaiku 4.5 · 5/18/2026

NoteThe source directly confirms three of the four components of the claim: (1) Hubinger's co-authorship of the 2019 paper introducing mesa-optimization is verified; (2) deceptive alignment is discussed in Section 4 as stated in the claim. However, the source does not address: (1) Hubinger's role leading 'Sleeper Agents and Alignment Faking research at Anthropic' — the source is from 2019 and does not discuss his later work at Anthropic; (2) the citation count of '3,400+ citations (as of 2026-03)' — the source text contains no citation metrics. The claim's temporal reference 'as of 2026-03' is also beyond the source document's publication date (2019), making the citation count unverifiable from this source. The paper excerpt confirms the core research contributions but cannot verify employment history or citation metrics.

Case № f_eH6tN2pQ5vFiled 5/18/2026Confidence 85%
Source Check: Notable For | Longterm Wiki