Skip to content
Longterm Wiki

Redwood Research

Safety Organization

Also known as: Redwood

Founded Jun 2021 (4 years old)HQ: San Francisco, CAredwoodresearch.orgWiki page →KB data →

Founded by Nate Thomas, and Buck Shlegeris

A nonprofit AI safety and security research organization founded in 2021, known for pioneering AI Control research, developing causal scrubbing interpretability methods, and conducting landmark alignment faking studies with Anthropic.

Revenue
$22,060
as of 2024
Headcount
34
as of 2023

Key Metrics

Revenue (ARR)

$22K2024
Revenue (ARR) chart. Annual run rate: $14M in 2021 to $22K in 2024.$0$4M$8M$12M$16M2021202220232024

Headcount

342023
Headcount chart. Employees: 10 in 2021 to 34 in 2023.01020304020212023

Facts

10
Financial
Revenue$22,060
Net Assets$6.5 million
Annual Expenses$2.9 million
Headcount34
Other
Board MemberPaul Christiano
Legal Identifier87-1702255 (EIN)
Organization
Legal StructureNonprofit research lab
HeadquartersSan Francisco, CA
Founded DateJun 2021
People

Divisions

1

Prediction Markets

8 active

Related Wiki Pages

Top Related Pages

Safety Research

Anthropic Core Views

Approaches

AI AlignmentCapability Elicitation

Analysis

AI Safety Intervention Effectiveness MatrixModel Organisms of Misalignment

Policy

Safe and Secure Innovation for Frontier Artificial Intelligence Models ActNew York RAISE Act

Organizations

Alignment Research CenterConjectureMachine Intelligence Research Institute

Risks

Deceptive AlignmentAI Capability Sandbagging

Other

Ajeya CotraHolden KarnofskyInterpretability

Concepts

Ea Epistemic Failures In The Ftx EraLarge Language ModelsLarge Language ModelsSituational Awareness

Key Debates

Why Alignment Might Be Easy