LessWrong

Blog PlatformGood(3)

Rationality and AI safety community blog

Resources

11706

Citing pages

Tracked domains

Credibility Rating

3/5

Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Tracked Domains

lesswrong.com

Resources (11706)

11706 resources

		Authors		Summary
A deep critique of AI 2027's bad timeline models	blog	titotal	2025-06-19	S	2
New safety research agenda: scalable agent alignment via reward modeling	web	Vika	2018-11-20	S	2
We must be very clear: fraud in the service of effective altruism is unacceptable	web	evhub	2022-11-10	S	2
LessWrong - Rationality and AI Safety Community Forum	blog	—	—	S	2
An Overview of the AI Safety Funding Situation	web	Stephen McAleese	2023-07-12	S	2
MATS Spring 2024 Extension Retrospective	blog	HenningB, Matthew Wearden +2	2025-02-12	S	1
ARENA 4.0 Impact Report	web	Chloe Li, JamesH +1	2024-11-27	S	1
(The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser	web	habryka	2024-11-30	S	1
What is Going On With CFAR?	web	niplav	2022-05-28	S	1
"Situational Awareness"	blog	—	—	S	1
LessWrong - Frontier Model Forum	blog	Zach Stein-Perlman	2023-07-26	S	1
Reducing Sycophancy and Improving Honesty via Activation Steering (Panickssery, 2023)	blog	Nina Panickssery	2023-07-28	S	1
Maybe Anthropic's Long-Term Benefit Trust is powerless	web	Zach Stein-Perlman	2024-05-27	S	1
Debate experiments at The Curve, LessOnline and Manifest	web	Nathan Young	2025-06-13	S	1
Comment reply: my low-quality thoughts on why CFAR didn't get farther with a "real/efficacious art of rationality"	web	AnnaSalamon	2022-06-09	S	1
SERI ML Alignment Theory Scholars Program 2022	web	Ryan Kidd, Victor Warlop +1	2022-04-27	S	1
The case for ensuring that powerful AIs are controlled	web	ryan_greenblatt, Buck	2024-01-24	S	1
LessWrong: "Disentangling Corrigibility: 2015-2021"	blog	Koen.Holtman	2021-02-16	S	1
MATS Winter 2023-24 Retrospective	web	utilistrutil, LauraVaughan +6	2024-05-11	S	1
MATS Alumni Impact Analysis	web	utilistrutil, Juan Gil +4	2024-09-30	S	1
Rationality: A-Z (The Sequences) — LessWrong	blog	—	—	S	1
Why Do AI Researchers Rate the Probability of Doom So Low?	blog	Aorou	2022-09-24	S	1
ARC's first technical report: Eliciting Latent Knowledge	web	paulfchristiano, Mark Xu +1	2021-12-14	S	1
LTFF and EAIF are unusually funding-constrained right now	web	Linch, calebp99	2023-08-30	S	1
LessWrong: The Most Forbidden Technique	blog	Zvi	2025-03-12	S	1

Rows per page:

Page 1 of 469

Citing Pages (40)

AI Accident Risk Cruxes Agent Foundations AI-Assisted Alignment AI Timelines Alignment Research Center (ARC)Capability-Alignment Race Model Center for Applied Rationality Conjecture ControlAI Corrigibility AI Risk Critical Uncertainties Model Deceptive Alignment Deep Learning Revolution Era EA Epistemic Failures in the FTX Era EA and Longtermist Wins and Losses Eli Lifland AI Safety Field Building and Community Frontier Model Forum Giving What We Can Goodfire Gratified Instrumental Convergence Lighthaven (Event Venue)Lionheart Ventures AI Value Lock-in Anthropic Long-Term Benefit Trust Long-Timelines Technical Worldview Manifest (Forecasting Conference)MATS ML Alignment Theory Scholars program Machine Intelligence Research Institute (MIRI)Model Organisms of Misalignment Optimistic Alignment Worldview Sam Bankman-Fried AI Capability Sandbagging Scalable Oversight Self-Improvement and Recursive Enhancement Sharp Left Turn Technical AI Safety Research Why Alignment Might Be Easy Why Alignment Might Be Hard

Publication ID: lesswrong