Longterm Wiki
Updated 2025-12-24HistoryData
Page StatusContent
Edited 7 weeks ago943 words2 backlinks
25
QualityDraft
22
ImportancePeripheral
4
Structure4/15
00800%62%
Updated every 6 weeksOverdue by 6 days
Summary

Comprehensive biographical profile of Nick Bostrom covering his founding of FHI, the landmark 2014 book 'Superintelligence' that popularized AI existential risk, and key philosophical contributions (orthogonality thesis, instrumental convergence, treacherous turn). The page documents his influence on the field but provides limited quantitative evidence or citations for claims about impact.

Issues1
StructureNo tables or diagrams - consider adding visual content

Nick Bostrom

Person

Nick Bostrom

Comprehensive biographical profile of Nick Bostrom covering his founding of FHI, the landmark 2014 book 'Superintelligence' that popularized AI existential risk, and key philosophical contributions (orthogonality thesis, instrumental convergence, treacherous turn). The page documents his influence on the field but provides limited quantitative evidence or citations for claims about impact.

AffiliationFuture of Humanity Institute
RoleFounding Director (until FHI closure in 2024)
Known ForSuperintelligence, existential risk research, simulation hypothesis
Related
Risks
Instrumental ConvergenceTreacherous Turn
People
Toby Ord
943 words · 2 backlinks
Person

Nick Bostrom

Comprehensive biographical profile of Nick Bostrom covering his founding of FHI, the landmark 2014 book 'Superintelligence' that popularized AI existential risk, and key philosophical contributions (orthogonality thesis, instrumental convergence, treacherous turn). The page documents his influence on the field but provides limited quantitative evidence or citations for claims about impact.

AffiliationFuture of Humanity Institute
RoleFounding Director (until FHI closure in 2024)
Known ForSuperintelligence, existential risk research, simulation hypothesis
Related
Risks
Instrumental ConvergenceTreacherous Turn
People
Toby Ord
943 words · 2 backlinks

Background

Nick Bostrom is a Swedish-born philosopher at Oxford University who founded the Future of Humanity Institute in 2005. He is widely recognized for bringing academic rigor to the study of existential risks and transformative technologies.

Academic background:

  • PhD in Philosophy from London School of Economics (2000)
  • Professor at Oxford University
  • Director of FHI (2005-2024, until institute closure)
  • Published extensively in philosophy, ethics, and technology

His 2014 book "Superintelligence: Paths, Dangers, Strategies" brought AI existential risk into mainstream discourse and influenced many current safety researchers.

Major Contributions

Superintelligence (2014)

This landmark book:

  • Systematically analyzed paths to superintelligence
  • Outlined control problems and failure modes
  • Introduced key concepts like orthogonality thesis and instrumental convergence
  • Made AI risk intellectually respectable
  • Influenced figures like Elon Musk and Bill Gates

The book's impact cannot be overstated - it fundamentally shaped how people think about advanced AI risks.

Existential Risk Framework

Bostrom pioneered academic study of existential risks:

  • Defined existential risk precisely
  • Argued for extreme importance (affects all future generations)
  • Created framework for analyzing different risks
  • Emphasized need for research and prevention

Key Philosophical Contributions

Orthogonality Thesis: Intelligence and goals are independent. A superintelligent system could have any goal, including harmful ones.

Instrumental Convergence: Many different final goals lead to similar instrumental goals (resource acquisition, self-preservation, etc.), creating predictable risks.

Treacherous Turn: Sufficiently intelligent systems might behave cooperatively until they're powerful enough to achieve goals without constraint.

Simulation Hypothesis

While not directly related to AI safety, Bostrom's simulation argument has influenced thinking about:

  • Nature of intelligence and consciousness
  • Future technological capabilities
  • Philosophical implications of advanced AI

Views on AI Risk

Core Arguments

  1. Superintelligence is possible: No fundamental barrier to intelligence far exceeding human level
  2. Default outcome is bad: Without careful preparation, superintelligent AI would likely not share human values
  3. Control is extremely difficult: Once superintelligence exists, control may be impossible
  4. Prevention is crucial: Must solve alignment before superintelligence emerges
  5. Stakes are existential: Failure could mean human extinction or permanent loss of potential

On Timelines

Bostrom has been relatively cautious about timelines:

  • Emphasizes uncertainty
  • Argues we should prepare even for unlikely scenarios
  • More focused on thinking through problems than predicting dates
  • "Superintelligence" discussed various paths with different timelines

On Solutions

"Superintelligence" explored several potential solutions:

  • Boxing: Physically or informationally constraining AI
  • Capability control: Limiting what AI can do
  • Motivation selection: Choosing safe goals/values
  • Value learning: AI learning human values
  • Whole brain emulation: Alternative path to superintelligence

He's generally skeptical that simple solutions will work, emphasizing complexity of the problem.

Influence and Impact

Academic Field Building

  • Founded FHI, which became major hub for existential risk research
  • Supervised numerous PhD students in x-risk
  • Published in top philosophy journals on AI and existential risk
  • Made studying AI risk academically legitimate

Public Awareness

  • "Superintelligence" became bestseller
  • Read by tech leaders, policymakers, and researchers
  • Sparked broader conversation about AI risks
  • Influenced funding decisions (e.g., Coefficient Giving's AI focus)

Policy Influence

  • Advised governments on emerging technologies
  • Influenced discussions at UN and other international bodies
  • Work cited in policy documents on AI governance

Research Community

  • Concepts from "Superintelligence" now standard in AI safety
  • Framework influences how researchers think about risks
  • Many current safety researchers cite book as influential

Other Work

Beyond AI, Bostrom has contributed to:

  • Human enhancement ethics: Should we enhance human capabilities?
  • Global catastrophic risks: Asteroids, pandemics, nuclear war
  • Information hazards: Risks from knowledge itself
  • Anthropic reasoning: How to reason about observer selection effects

Controversies and Criticisms

FHI Closure (2024)

FHI closed in 2024 due to administrative issues with Oxford. This ended a major chapter in existential risk research, though many former FHI researchers continue the work elsewhere.

Criticisms of "Superintelligence"

Some argue:

  • Overestimates difficulty of alignment
  • Underestimates difficulty of achieving superintelligence
  • Too focused on specific scenarios
  • Anthropomorphizes AI systems

Supporters counter:

  • Book was prescient about many challenges now visible
  • Appropriately cautious given stakes
  • Scenarios remain plausible
  • Better to overestimate risks than underestimate

Academic vs. Applied Research

Some critics argue:

  • FHI did too much philosophical work, not enough technical research
  • Frameworks don't translate directly to engineering solutions

Others counter:

  • Conceptual clarity is essential foundation
  • Philosophy identifies problems engineers then solve
  • FHI's role was complementary to technical work

Evolution of Views

Early work (1990s-2000s):

  • Broad focus on existential risks
  • Technological optimism balanced with caution
  • Development of existential risk framework

Superintelligence era (2010s):

  • Deep dive into AI-specific risks
  • Systematic analysis of control problems
  • Major public communication effort

Recent (2020s):

  • Less public-facing work
  • Continued academic research
  • More focus on other existential risks

Legacy

Bostrom's lasting contributions include:

  1. Intellectual framework: Concepts and vocabulary for discussing AI risk
  2. Academic legitimacy: Made existential risk a serious field of study
  3. Institution building: FHI trained a generation of x-risk researchers
  4. Public awareness: Brought risks to attention of decision-makers
  5. Rigorous analysis: Demonstrated philosophical methods can illuminate AI safety

Even critics acknowledge his role in establishing AI safety as a field.

Key Publications

  • "Superintelligence: Paths, Dangers, Strategies" (2014) - The landmark book
  • "Existential Risk Prevention as Global Priority" (2013) - Framework for x-risk
  • "Ethical Issues in Advanced Artificial Intelligence" (2003) - Early AI safety paper
  • "Are You Living in a Computer Simulation?" (2003) - Simulation argument
  • "The Vulnerable World Hypothesis" (2019) - Risks from technological development

Related Pages

Top Related Pages

Concepts

AnthropicInstrumental ConvergenceAI GovernanceCoefficient GivingElon Musk (AI Industry)Future of Humanity Institute

People

Stuart Russell

Risks

Power-Seeking AI

Historical

Early Warnings Era

Organizations

Machine Intelligence Research Institute

Policy

MAIM (Mutually Assured AI Malfunction)

Labs

Safe Superintelligence Inc.Center for AI Safety

Models

Carlsmith's Six-Premise ArgumentLock-in Mechanisms Model

Transition Model

Existential CatastropheAI Takeover