Skip to content
Longterm Wiki

Sleeper Agents Research

TeamActive
Anthropic·2024-01present·Wiki page →

Investigating whether AI systems can maintain hidden behaviors through training. Seminal paper on deceptive alignment.

Other Anthropic Divisions

6
Sleeper Agents Research | Anthropic | Divisions | Longterm Wiki