Skip to content
Longterm Wiki
Back

Introducing Google DeepMind's Frontier Safety Framework

web

Credibility Rating

4/5
High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Google DeepMind

DeepMind's formal safety framework announcement, comparable to Anthropic's Responsible Scaling Policy and OpenAI's Preparedness Framework; useful for understanding industry-wide approaches to frontier model safety governance and evaluation thresholds.

Metadata

Importance: 72/100blog postprimary source

Summary

DeepMind introduces its Frontier Safety Framework (FSF), a structured approach to identifying and mitigating catastrophic risks from frontier AI models. The framework establishes 'critical capability levels' (CCLs) as thresholds that trigger mandatory safety evaluations and mitigations before deployment. It focuses on identifying dangerous capabilities in areas like biosecurity, cybersecurity, and autonomous AI action.

Key Points

  • Defines 'Critical Capability Levels' (CCLs) as specific thresholds of dangerous model capabilities that require intervention before deployment.
  • Focuses on four main risk domains: CBRN weapons uplift, cyberoffensive capabilities, undermining AI oversight, and enabling unprecedented power seizure.
  • Commits to pausing deployment or development if safety mitigations cannot adequately address identified critical capabilities.
  • Framework includes regular model evaluations and red-teaming to detect dangerous capabilities early in training.
  • Represents DeepMind's public commitment to safety governance alongside similar frameworks from Anthropic and OpenAI.

Cited by 1 page

PageTypeQuality
Google DeepMindOrganization37.0

Cached Content Preview

HTTP 200Fetched Mar 15, 20260 KB
[Skip to main content](https://deepmind.google/discover/blog/introducing-our-frontier-safety-framework/#page-content)

# Page not found

Sorry, this page could not be found.

[Go back home](https://deepmind.google/)
Resource ID: 022861b62403527a | Stable ID: NTBkOTU2Nj