Co-founded Future of Life Institute; developed 23 Asilomar AI Principles; organized 2023 AI pause letter with 30,000+ signatories; author of Life 3.0 (NYT bestseller); TIME 100 Most Influential in AI 2023
MIT professor and FLI co-founder; argued it would be 'incongruous to say that there is a 10-25% chance that advanced AI will cause an extinction-level catastrophe, but then argue that the culpable AI companies should only receive fines after a catastrophe occurs'
MIT Physics Professor, AI Safety Advocate, FLI President
Key Contributions
Co-founded Future of Life Institute; developed 23 Asilomar AI Principles adopted by 1,000+ researchers; organized 2023 AI pause letter with 30,000+ signatories
Main Focus Areas
AI safety and governance, mechanistic interpretabilityResearch AreaInterpretabilityMechanistic interpretability has extracted 34M+ interpretable features from Claude 3 Sonnet with 90% automated labeling accuracy and demonstrated 75-85% success in causal validation, though less th...Quality: 66/100, cosmology, Mathematical Universe Hypothesis
Notable Works
Life 3.0 (2017), Our Mathematical Universe (2014), 300+ technical publications
Recognition
Time 100 Most Influential in AI (2023), American Physical Society Fellow (2012)
Key Concern
AGI misalignment and the "control problem" - preventing advanced AI from pursuing goals incompatible with human values