Longterm Wiki

Expert Positions12 topics

Topic	View	Estimate	Confidence	Date	Source
AGI Timelines	Medium	2035-2045	low	2023	—
P(doom)	Significant	10-20%	medium	2023	—
How Hard Is Alignment?	Hard but tractable	50%	medium	2023	—
Current Approaches Scale	Uncertain	40%	medium	2023	ARC Research (2023)
Inner Alignment Solvability	Hard but tractable	Solvable with sufficient investment	medium	2023	—
Likelihood of Deceptive Alignment	Significant concern	50%	medium	2023	—
P(AI X-Risk This Century)	Moderate	~20-50%	medium	2023	—
Would Misalignment Be Catastrophic?	Uncertain, depends on scenario	20-40% → catastrophic	low	2023	—
P(AI Catastrophe)	Significant	10-20%	medium	2023	Various posts (2022-2023)
Takeoff Speed	Slow	5-15 years	medium	2023	—
Will Advanced AI Be Deceptive?	Possibly detectable	40%	medium	2023	—
Will We Get Adequate Warning?	Likely	70%	medium	2023	—

Organization Roles2

NIST and AI SafetyCurrent

Head of AI Safety

Apr 2024 – present

AI ImpactsFounderCurrent

Co-founder

2014 – present

Board Seats2

ArkoseCurrent

Strategic Advisor

Anthropic Long-Term Benefit Trust

Trustee

2023 – 2024

Education

PhD in Computer Science, UC Berkeley; BS in Mathematics, MIT

From wiki articleRead full article →

Overview

Paul Christiano is one of the most influential researchers in AI alignment, known for developing concrete, empirically testable approaches to the alignment problem. With a PhD in theoretical computer science from UC Berkeley, he has worked at OpenAI, DeepMind, and founded the Alignment Research Center (ARC).

Christiano pioneered the "prosaic alignment" approach—aligning AI without requiring exotic theoretical breakthroughs. His current risk assessment places ~10-20% probability on existential risk from AI this century, with AGI arrival in the 2030s-2040s. His work has directly influenced alignment research programs at major labs including OpenAI, Anthropic, and DeepMind.

Risk Assessment

Risk Factor	Christiano's Assessment	Evidence/Reasoning	Comparison to Field
P(doom)	≈10-20%	Alignment tractable but challenging	Moderate (vs 50%+ doomers, <5% optimists)
AGI Timeline	2030s-2040s	Gradual capability increase	Mainstream range
Alignment Difficulty	Hard but tractable	Iterative progress possible	More optimistic than MIRI
Coordination Feasibility	Moderately optimistic	Labs have incentives to cooperate	More optimistic than average

Read full article →