ARC Evaluations
Safety OrganizationOrganization focused on evaluating AI systems for dangerous capabilities. Now largely absorbed into METR.
Related Wiki Pages
Top Related Pages
Organization
METR
Model Evaluation and Threat Research conducts dangerous capability evaluations for frontier AI models, testing for autonomous replication, cybersec...
Organization
Alignment Research Center (ARC)
AI safety research nonprofit operating as ARC Theory, investigating fundamental alignment problems including Eliciting Latent Knowledge and heurist...
Crux
AI Alignment Research Agendas
Analysis of major AI safety research agendas comparing approaches from Anthropic (\$100M+ annual safety budget, 37-39% team growth), DeepMind (30-5...
Person
Beth Barnes
Person
Paul Christiano
Founder of ARC, creator of iterated amplification and AI safety via debate. Current risk assessment ~10-20% P(doom), AGI 2030s-2040s. Pioneered pro...