AI Safety Knowledge Base

A structured reference covering risks, technical approaches, governance, organizations, and key people shaping the future of AI safety.

Browse all 641 pages→View risks

641

Wiki pages

Risks documented

123

Approaches & policies

Organizations

Recently updated

View all →

Analysis

2.2k words

Anthropic Valuation Analysis

Valuation analysis updated for Series G (Feb 2026). Anthropic raised $30B at $380B post-money with $14B run-rate revenue, yielding ~27x multiple—no...

anthropicvaluationrevenue-multiples

Analysis

6.8k words

Anthropic (Funder)

Comprehensive model of EA-aligned philanthropic capital at Anthropic. At $380B valuation (Series G, Feb 2026, $30B raised): $27-76B risk-adjusted E...

anthropicea-capitalfounder-pledges

Analysis

4.2k words

Anthropic IPO

Anthropic is actively preparing for a potential 2026 IPO with concrete steps like hiring Wilson Sonsini and conducting bank discussions, though tim...

anthropicipopublic-offering

Project

2.2k words

Longterm Wiki

A self-referential documentation page describing the Longterm Wiki platform itself—a strategic intelligence tool with ~550 pages, crux mapping of ~...

Risk

4.1k words

Rogue AI Scenarios

Analysis of five scenarios for agentic AI takeover-by-accident—sandbox escape, training signal corruption, correlated policy failure, delegation ch...

agentic-aiinstrumental-convergencewarning-shots

Model

3.5k words

AI Acceleration Tradeoff Model

Quantitative framework for evaluating how changes to AI development speed affect existential risk and long-term value. Models the marginal impact o...

accelerationtimelinessafety

Organization

2.7k words

Goodfire

Goodfire is a well-funded AI interpretability startup valued at $1.25B (Feb 2026) developing mechanistic interpretability tools like Ember API to m...

mechanistic-interpretabilitysparse-autoencodersai-safety-startup

Policy

1.4k words

MAIM (Mutually Assured AI Malfunction)

MAIM (Mutually Assured AI Malfunction) is a deterrence framework introduced in the 2025 paper 'Superintelligence Strategy' by Dan Hendrycks (CAIS),...

ai-deterrenceinternational-governancegeopolitics

641 pages · Continuously updated

AI Safety Knowledge Base

Explore by topic

AI Safety

Governance

Biorisks

Epistemics

Recently updated

Anthropic Valuation Analysis

Anthropic (Funder)

Anthropic IPO

Longterm Wiki

Rogue AI Scenarios

AI Acceleration Tradeoff Model

Goodfire

MAIM (Mutually Assured AI Malfunction)