Index
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Verdictconfirmed95%
1 check · 4/29/20261 → confirmed
Our claim
entire record- Title
- The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
- Authors
- Nathaniel Li, Alexander Pan, Anjali Gopal et al.
- Published Date
- 2024
- Publication Type
- paper
- Is Flagship
- Yes
- Source
- https://www.wmdp.ai/
- Notes
- ICML 2024. Biosecurity/cybersecurity knowledge unlearning.
Source evidence
1 src · 1 checkconfirmed95%Haiku 4.5 · 4/3/2026
NoteAll key fields in the record are confirmed by the source text. The title matches exactly. The authors Nathaniel Li, Alexander Pan, and Anjali Gopal are confirmed as the first three authors (the 'et al.' appropriately indicates additional authors, which the citation block confirms). The publication year 2024 is confirmed. The URL https://www.wmdp.ai/ is the official website shown in the source. The publication type 'paper' is confirmed by the arXiv reference (2403.03218) and the citation format. No contradictions exist.
Case № 2G9YlHLXK6Filed 4/29/2026Confidence 95%