Skip to content
Longterm Wiki

Nelson Elhage — Research Scientist at Anthropic

personnelFailed source check

Metadata

Source Tablepersonnel
Source IDMwZ6VCPEIg
Source URLnelhage.com/
ParentAnthropic
Children
CreatedMar 24, 2026, 8:45 PM
UpdatedApr 10, 2026, 8:49 PM
SyncedApr 10, 2026, 8:49 PM

Record Data

idMwZ6VCPEIg
personIdNelson Elhage(person)
organizationIdAnthropic(organization)
personEntityIdNelson Elhage(person)
personDisplayName
orgEntityIdAnthropic(organization)
orgDisplayName
roleResearch Scientist
roleTypecareer
startDate2021
endDate
isFounderNo
appointedBy
background
sourcenelhage.com/
notesKey contributor to mechanistic interpretability research; co-authored 'Toy Models of Superposition' and other foundational interpretability papers Per nelhage.com, currently on Anthropic pretraining team; previously worked on reverse-engineering large language models (interpretability/circuits).

Source Check Verdicts

unverifiable85% confidence

Last checked: 5/11/2026

1 → unverifiable; dissent: 1 → partial

Debug info

Thing ID: MwZ6VCPEIg

Source Table: personnel

Source ID: MwZ6VCPEIg