Index
Center for AI Safety (CAIS) — publication: The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning — benchmark for evaluating dual-use AI capabilities in biosecurity, cybersecurity, and chemical weapons
Verdictconfirmed95%
1 check · 5/18/20261 → confirmed
Our claim
entire record- Subject
- Center for AI Safety (CAIS)
- Value
- The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning — benchmark for evaluating dual-use AI capabilities in biosecurity, cybersecurity, and chemical weapons
- As Of
- March 2024
- Notes
- By Li et al.
Source evidence
1 src · 1 checkconfirmed95%primaryHaiku 4.5 · 5/18/2026
NoteThe source directly confirms all key elements of the claim: (1) CAIS (Center for AI Safety) is listed as an affiliation for multiple authors including the first author Nathaniel Li; (2) the publication title matches exactly; (3) the benchmark covers biosecurity, cybersecurity, and chemical weapons/security as stated; (4) the date matches (2024-03 corresponds to March 2024, which matches the arXiv ID 2403.03218); (5) the authors include Li et al. as mentioned in the additional context. The source is the actual paper itself, making this a direct confirmation.
Case № f_apd264NL5hFiled 5/18/2026Confidence 95%