publication
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
Metadata
| Source Table | publications |
| Source ID | lCVquUGcsU |
| Description | Mantas Mazeika, Long Phan, Xuwang Yin et al., 2024 |
| Source URL | harmbench.org/ |
| Parent | Center for AI Safety (CAIS) |
| Children | — |
| Created | Mar 23, 2026, 2:46 PM |
| Updated | Mar 23, 2026, 2:46 PM |
| Synced | Mar 23, 2026, 2:46 PM |
Record Data
id | lCVquUGcsU |
entityId | Center for AI Safety (CAIS)(organization) |
entityDisplayName | — |
resourceId | — |
title | HarmBench: A Standardized Evaluation Framework for Automated Red Teaming |
authors | Mantas Mazeika, Long Phan, Xuwang Yin et al. |
url | harmbench.org/ |
venue | — |
publishedDate | 2024 |
publicationType | paper |
citationCount | — |
isFlagship | Yes |
abstract | — |
source | harmbench.org/ |
notes | ICML 2024 |
Source Check Verdicts
unverifiable95% confidence
Last checked: 3/26/2026
The source text is merely a JavaScript error message from the HarmBench website and does not contain any substantive information about the publication itself. It does not confirm or contradict any of the claimed metadata (title, authors, publication date, publication type). While the URL matches the claimed domain, the source text itself provides no verifiable content about the paper's details. To verify this record, actual publication metadata (from arXiv, a conference proceedings, or similar) would be needed.
Debug info
Thing ID: lCVquUGcsU
Source Table: publications
Source ID: lCVquUGcsU