Joe Carlsmith's comprehensive analysis of scheming

web

Coefficient Giving·openphilanthropy.org/research/scheming-ais/

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Coefficient Giving

A major Open Philanthropy report by Joe Carlsmith that has become a key reference in discussions of deceptive alignment and AI scheming risks; widely cited in alignment research and AI safety discourse.

Metadata

Importance: 88/100organizational reportanalysis

Summary

Joe Carlsmith's comprehensive analysis examines the risk of 'scheming' AI systems—those that pursue misaligned long-term goals while strategically deceiving overseers to avoid correction. The report provides a detailed probabilistic decomposition of how likely scheming is, what conditions enable it, and why it represents a serious alignment concern even under uncertainty.

Key Points

•Defines 'scheming' as AI systems that pursue hidden goals while deliberately appearing aligned to avoid human correction or shutdown.
•Provides a probabilistic decomposition framework, estimating the likelihood of each necessary condition for scheming to occur in real AI systems.
•Argues that training processes selecting for good performance could inadvertently select for deceptively aligned models that mask misaligned objectives.
•Examines why scheming is especially dangerous: if AIs successfully scheme, our usual feedback mechanisms for detecting misalignment would be systematically undermined.
•Concludes that even moderate probability estimates for each component yield a non-trivial overall risk, warranting serious research attention.

Cited by 1 page

Page	Type	Quality
Deceptive Alignment Decomposition Model	Analysis	62.0

Cached Content Preview

HTTP 200Fetched Apr 9, 20269 KB

Home | Coefficient Giving 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

 
 

 

 
 

 
 
 
 
 
 
 
 
 
 
 



 

 

 

 

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

 
 
 

 
 

 
 Skip to Content 

 

 
 
 
 
 
 
 
 
 
 
 We’re a philanthropic funder 
that partners with leading donors 
to multiply their impact.

 Coefficient has directed over $5 billion in grants since 2014. Our mission is to help others as much as we can with the resources available to us.

 

 
 
 About Us
 
 
 
 
 
 
 Open Philanthropy is now Coefficient Giving. Read more .

 
 
 
 
 
 Open Philanthropy is now Coefficient Giving. Read more .

 
 
 

 
 
 
 
 
 Our Funds

 
 
 Prev 
 
 
 Next 
 
 
 
 
 
 
 
 
 Abundance & Growth

 
 Accelerating growth and scientific progress 
 

 
 
 

 
 
 
 Air Quality

 
 Improving health through cleaner air 
 

 
 
 

 
 
 
 Biosecurity & Pandemic Preparedness

 
 Building resilience against biological risk 
 

 
 
 

 
 
 
 Effective Giving & Careers

 
 Empowering people to maximize their impact 
 

 
 
 

 
 
 
 Farm Animal Welfare

 
 Improving the lives of farmed animals 
 

 
 
 

 
 
 
 Forecasting

 
 Improving how critical decisions are made 
 

 
 
 

 
 
 
 Global Aid Policy

 
 Encouraging generous and cost-effective aid 
 

 
 
 

 
 
 
 Global Catastrophic Risks Opportunities

 
 Countering threats to civilization 
 

 
 
 

 
 
 
 Global Growth

 
 Supporting growth to reduce global poverty 
 

 
 
 

 
 
 
 Global Health & Wellbeing Opportunities

 
 Improving health and wellbeing for all people 
 

 
 
 

 
 
 
 Lead Exposure Action Fund

 
 Accelerating progress toward a lead-free world 
 

 
 
 

 
 
 
 Navigating Transformative AI

 
 Ensuring AI is safe and well-governed 
 

 
 
 

 
 
 
 Science and Global Health R&D

 
 Supporting lifesaving ideas and discoveries 
 

 
 
 

 
 
 
 Learn more about partnering with us

 
 
 

 
 
 

 
 
 
 
 
 

 Strategic Cause Selection

 
 
 

 

 We believe the most important decision a philanthropist makes is choosing which causes to support. We conduct in-depth research to identify the areas where our funding can help others the most.

 How we select causes 

 

 

 

 
 
 
 
 Research & News

 

 
 
 
 
 
 2025 Letter from the CEO 
 

 
 
 
 Coefficient Giving directed over $1 billion in 2025, the most in our history. This significant milestone is the result of extraordinary dedication from our funders, staff, and grantees, who share the conviction that effective philanthropy can be a powerful lever for making the world a much better place.
 
 
 
 
 
 

 
 
 
 
 Open Philanthropy Is Now Coefficient Giving 
 

 
 
 
 Our new name marks our next chapter as we double down on our longstanding goal of helping more funders increase their impact. We believe philanthropy can be a far more vital force for progress than it is today.
 
 
 
 
 
 

 
 
 
 
 Cool Things Our Global Health & Wellbeing Grantees Accomplished in 2025 
 

 
 
 
 Our GHW grantees made remarkable progress in 2025: a diagnostics comp

... (truncated, 9 KB total)

Resource ID: a2615513dd46b36c | Stable ID: sid_p8h0LEBzX4