Also known as: Redwood
Policy & Governance
Policy Positions1| Date | Event | Type | Description | Source |
|---|---|---|---|---|
| Sep 2021 | Tax-exempt status granted; 10 staff assembled | Founding | — | projects.propublica.org (opens in new tab) |
| Dec 2021 | MLAB bootcamp launches | Launch | Inaugural ML for Alignment Bootcamp with 40 participants; 3-week intensive teaching attendees to build BERT/GPT-2 from scratch. | blog.redwoodresearch.org (opens in new tab) |
| 2022 | Adversarial robustness research project | Milestone | Initial adversarial training project; later acknowledged by leadership as unsuccessful. | blog.redwoodresearch.org (opens in new tab) |
| 2022 | Causal scrubbing methodology developed | Publication | Developed across 2022-2023; method for rigorously testing mechanistic interpretability claims. | lesswrong.com (opens in new tab) |
| 2023 | REMIX interpretability program runs | Launch | Mechanistic interpretability training program for ~10-15 junior researchers. | forum.effectivealtruism.org (opens in new tab) |
| 2024 | Buck Shlegeris becomes CEO; AI Control ICML oral | Leadership Change | Buck Shlegeris transitions from CTO to CEO and Director; Ryan Greenblatt serves as Chief Scientist. AI Control work accepted as an ICML oral. | projects.propublica.org (opens in new tab) |
| Dec 2024 | Alignment faking paper with Anthropic | Publication | Landmark collaboration with Anthropic on alignment faking research. | anthropic.com (opens in new tab) |