Index
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
Verdictunverifiable95%
1 check · 4/29/20261 → unverifiable
Our claim
entire record- Title
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
- Authors
- Mantas Mazeika, Long Phan, Xuwang Yin et al.
- Published Date
- 2024
- Publication Type
- paper
- Is Flagship
- Yes
- Source
- https://harmbench.org/
- Notes
- ICML 2024
Source evidence
1 src · 1 checkunverifiable95%Haiku 4.5 · 3/26/2026
NoteThe source text is merely a JavaScript error message from the HarmBench website and does not contain any substantive information about the publication itself. It does not confirm or contradict any of the claimed metadata (title, authors, publication date, publication type). While the URL matches the claimed domain, the source text itself provides no verifiable content about the paper's details. To verify this record, actual publication metadata (from arXiv, a conference proceedings, or similar) would be needed.
Case № lCVquUGcsUFiled 4/29/2026Confidence 95%