Sleeper Agents Research
TeamActiveInvestigating whether AI systems can maintain hidden behaviors through training. Seminal paper on deceptive alignment.
Investigating whether AI systems can maintain hidden behaviors through training. Seminal paper on deceptive alignment.