Fact·f_apd264NL5h·Fact

Center for AI Safety (CAIS) — publication: The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning — benchmark for evaluating dual-use AI capabilities in biosecurity, cybersecurity, and chemical weapons

Verdictconfirmed95%

1 check · 5/18/2026

1 → confirmed

Our claim

entire record

Subject: Center for AI Safety (CAIS)
Value: The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning — benchmark for evaluating dual-use AI capabilities in biosecurity, cybersecurity, and chemical weapons
As Of: March 2024
Source: https://arxiv.org/abs/2403.03218
Notes: By Li et al.

Source evidence

1 src · 1 check

arxiv.org/abs/2403.03218 resource

confirmed95%primaryHaiku 4.5 · 5/18/2026

NoteThe source directly confirms all key elements of the claim: (1) CAIS (Center for AI Safety) is listed as an affiliation for multiple authors including the first author Nathaniel Li; (2) the publication title matches exactly; (3) the benchmark covers biosecurity, cybersecurity, and chemical weapons/security as stated; (4) the date matches (2024-03 corresponds to March 2024, which matches the arXiv ID 2403.03218); (5) the authors include Li et al. as mentioned in the additional context. The source is the actual paper itself, making this a direct confirmation.

Case № f_apd264NL5hFiled 5/18/2026Confidence 95%