OpenAI — AI Safety Level: High/Critical capability thresholds (Preparedness Framework v2)
1 → confirmed
Our claim
entire record- Subject
- OpenAI
- Property
- AI Safety Level
- Value
- High/Critical capability thresholds (Preparedness Framework v2)
- As Of
- April 2025
- Notes
- OpenAI uses its own Preparedness Framework (v2, April 2025) rather than Anthropic's ASL system. Two thresholds: 'High capability' (could amplify existing harm pathways) and 'Critical capability' (unprecedented new harm pathways). No models currently rated at Critical level.
Source evidence
1 src · 1 checkNoteThe source directly confirms the core claim: (1) OpenAI uses its own Preparedness Framework v2 (dated April 2025, matching the claim's 'as of 2025-04'); (2) it defines two thresholds labeled 'High' and 'Critical' capability; (3) High capability is described as amplifying existing harm pathways (e.g., 'significantly increased likelihood and frequency of biological or chemical terror events'); (4) Critical capability is described as enabling unprecedented new harm pathways (e.g., 'novel threat vector...could cause millions of deaths'). The claim that 'no models currently rated at Critical level' is strongly implied by the framework's language requiring halting development until safeguards are in place, though not explicitly stated in the provided excerpt. The subject (OpenAI's Preparedness Framework) matches the claim perfectly.