Skip to content
Longterm Wiki

External Scorecards

Side-by-side overall grades from the five major external AI-safety scorecards. Click any row to view per-dimension grades on the organization's profile.

Scorecards
5
Sources w/ Data
5
Organizations
25
Cells
57

Overall grades — latest wave per scorecard

OrganizationFLI IndexSaferAIAI Lab WatchFMTISeoul Tracker
AI21 Labs66
Alibaba CloudD-26
AmazonVery Weak39Fulfilled
AnthropicC+WeakWeak46Fulfilled
CohereVery WeakPartial
DeepSeekDVery Weak32
G42Very WeakFulfilled
Google41Fulfilled
Google DeepMindCVery WeakVery Weak
IBM95Unfulfilled
Inflection AIUnfulfilled
MagicVery Weak
Meta AI (FAIR)DVery WeakVery Weak31Partial
Microsoft AIVery WeakVery WeakFulfilled
Midjourney14
Mistral18Unfulfilled
NaverVery WeakPartial
NVIDIAVery Weak
OpenAIC+WeakVery Weak35Fulfilled
Samsung ElectronicsUnfulfilled
Technology Innovation InstituteUnfulfilled
Writer72
xAIDVery WeakVery Weak14Partial
Z.aiD
Z.aiDUnfulfilled

Methodology

FLI AI Safety Index

by Future of Life Institute

Letter-grade ratings of frontier AI labs across six safety domains: risk assessment, current harms, safety frameworks, existential safety, governance, and information sharing.

Source ↗Methodology ↗Latest: Winter 2025License: fair-use-citation

SaferAI Ratings

by SaferAI

Continuously-updated risk-management ratings for frontier developers across four pillars: risk identification, analysis & evaluation, treatment, and governance.

Source ↗Methodology ↗Latest: October 2025License: CC BY-SA 4.0

AI Lab Watch

by Zach Stein-PerlmanNo longer maintained

Weighted scorecard across seven categories (risk assessment, scheming prevention, safety research, misuse prevention, security, info sharing, planning).

Source ↗Methodology ↗Latest: September 2025 (frozen)License: fair-use-citation

Foundation Model Transparency Index

by Stanford CRFM

Transparency-focused index scoring developers on 100 indicators across upstream resources, the model itself, and downstream use.

Source ↗Methodology ↗Latest: v1.2 December 2025License: CC-BY-4.0

Seoul Commitment Tracker

by The Midas Project

Tracks adherence to the Seoul Frontier AI Safety Commitments via 'Fulfilled / Partial / Unfulfilled' verdicts on five red-line components.

Source ↗Methodology ↗Latest: February 2025License: fair-use-citation

Grades are mirrored from upstream sources. We do not score organizations ourselves — see each row's source link for methodology and full per-dimension grades. Citation of published grades is consistent with fair-use attribution.