Fact·f_1BWsBJuBcg·Fact

Center for AI Safety (CAIS) — Description: The Center for AI Safety (CAIS) is a San Francisco-based nonprofit focused on reducing societal-scale risks from AI through technical safety research, field-building, and public communication. Founded by Dan Hendrycks and Oliver Zhang. Known for the MMLU benchmark, representation engineering, and the May 2023 "Statement on AI Risk" signed by 350+ AI leaders.

Verdictpartial85%

1 check · 5/18/2026

1 → partial

Our claim

entire record

Subject: Center for AI Safety (CAIS)
Property: Description
Value: The Center for AI Safety (CAIS) is a San Francisco-based nonprofit focused on reducing societal-scale risks from AI through technical safety research, field-building, and public communication. Founded by Dan Hendrycks and Oliver Zhang. Known for the MMLU benchmark, representation… expand
The Center for AI Safety (CAIS) is a San Francisco-based nonprofit focused on reducing societal-scale risks from AI through technical safety research, field-building, and public communication. Founded by Dan Hendrycks and Oliver Zhang. Known for the MMLU benchmark, representation engineering, and the May 2023 "Statement on AI Risk" signed by 350+ AI leaders.
As Of: 2025
Source: https://www.safe.ai/about

Source evidence

1 src · 1 check

www.safe.ai/about resource

partial85%primaryHaiku 4.5 · 5/18/2026

NoteThe source confirms CAIS's founders (Dan Hendrycks and Oliver Zhang) and its core mission of reducing societal-scale risks from AI through research, field-building, and advocacy/public communication. However, several claims are unverifiable or contradicted: (1) San Francisco location is not mentioned in the source; (2) MMLU benchmark is not mentioned; (3) Representation engineering is not mentioned; (4) The 'May 2023 Statement on AI Risk signed by 350+ AI leaders' is contradicted—the source states a 'Global Statement on AI Risk signed by 600 leading AI researchers and public figures' with no date specified. The number 600 exceeds 350+, but the date and exact framing differ. The claim's 'as of 2025' temporal marker cannot be verified against the source's undated content.

Case № f_1BWsBJuBcgFiled 5/18/2026Confidence 85%