Evan Hubinger — Notable For: Co-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations
1 → partial
Our claim
entire record- Subject
- Evan Hubinger
- Property
- Notable For
- Value
- Co-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations
- As Of
- March 2026
- Notes
- Enriched from wiki page
Source evidence
1 src · 1 checkNoteThe source directly confirms three of the four components of the claim: (1) Hubinger's co-authorship of the 2019 paper introducing mesa-optimization is verified; (2) deceptive alignment is discussed in Section 4 as stated in the claim. However, the source does not address: (1) Hubinger's role leading 'Sleeper Agents and Alignment Faking research at Anthropic' — the source is from 2019 and does not discuss his later work at Anthropic; (2) the citation count of '3,400+ citations (as of 2026-03)' — the source text contains no citation metrics. The claim's temporal reference 'as of 2026-03' is also beyond the source document's publication date (2019), making the citation count unverifiable from this source. The paper excerpt confirms the core research contributions but cannot verify employment history or citation metrics.