Neel Nanda - Senior Research Scientist at Google DeepMind (Prog.AI Profile)

web

This is Neel Nanda's professional profile page, a leading mechanistic interpretability researcher at Google DeepMind and creator of TransformerLens, a key open-source tool widely used in AI safety interpretability research.

Metadata

Importance: 42/100homepage

Summary

This is a third-party professional profile of Neel Nanda, Senior Research Scientist at Google DeepMind specializing in mechanistic interpretability. He is the creator of TransformerLens, contributor to sparse autoencoders (Gemma Scope), and has published work on grokking. He has prior affiliations with Anthropic, FHI, and CHAI.

Key Points

•Neel Nanda is a leading mechanistic interpretability researcher at Google DeepMind, formerly at Anthropic, FHI, and CHAI.
•Creator of TransformerLens, an open-source library widely used for mechanistic interpretability of GPT-style language models.
•Contributed to sparse autoencoders research including Gemma Scope and Gated SAEs (NeurIPS 2024).
•Published ICLR Spotlight paper on progress measures for grokking, bridging practical and theoretical interpretability.
•Active in the Effective Altruism community and committed to donating 10% of income to high-impact charities.

Cached Content Preview

HTTP 200Fetched Apr 28, 20262 KB

Neel Nanda - Senior Research Scientist at Google DeepMind Neel Nanda

 Senior Research Scientist at Google DeepMind 

 London, England, United Kingdom Join Prog.AI to see contacts Join Prog.AI to see contacts Summary

 🤩 Rockstar 🎓 Top School Neel Nanda is a Senior Research Scientist at Google DeepMind in London who focuses on mechanistic interpretability and the safety of transformer-based language models. His recent work includes contributions to sparse autoencoders (Gemma Scope and Gated SAEs, NeurIPS 2024) and an ICLR Spotlight paper on progress measures for grokking, reflecting a blend of practical tooling and theoretical insight. He is the creator of TransformerLens, an open-source library used for mechanistic interpretability, and has a track record of mentoring and publishing work that helps onboard others to the field. Formerly at Anthropic and affiliated with FHI and CHAI, he brings seven years of research and engineering experience grounded in a BA in Mathematics from Cambridge. Outside of research he’s active in the Effective Altruism community and has committed to donating 10% of his income to high-impact charities. 8 years of coding experience 3 years of employment as a software developer BA, Mathematics, BA, Mathematics at University of Cambridge The Latymer School Github Skills ( 6 )

 transformers 10 transformer-models 10 pytorch 10 machine-learning 10 python 10 natural-language-processing 10 Programming languages ( 6 )

 TypeScript CSS TeX Swift Jupyter Notebook Python Github contributions (5)

 More TransformerLensOrg/TransformerLens Aug 2022 - Jan 2023

 A library for mechanistic interpretability of GPT-style language models Role in this project: ML Engineer & Data Scientist Contributions: 5 releases, 89 reviews, 285 commits in 5 months Contributions summary: Neel appears to be adding to and modifying a library designed for mechanistic interpretability of GPT-style language models. Their contributions include implementing and refining the model architecture (Embed, Unembed, PosEmbed, LayerNorm, Attention, MLP, TransformerBlock) and adding support for different models. A basic demo code for the model is added. neelnanda-io/neelutils Oct 2022 - Mar 2024

 Contributions: 13 pushes in 1 year 5 months Find and Hire Top Developers We’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers. Request Free Trial Privacy Policy Privacy Data Subjects Terms of Use Do Not Sell My Info

Resource ID: 32d05e0a6918ed0e | Stable ID: sid_8NkaXIUhTQ