Skip to content
Longterm Wiki
publication

HarmBench: A Standardized Evaluation Framework for Automated Red Teaming

Metadata

Source Tablepublications
Source IDlCVquUGcsU
DescriptionMantas Mazeika, Long Phan, Xuwang Yin et al., 2024
Source URLharmbench.org/
ParentCenter for AI Safety (CAIS)
Children
CreatedMar 23, 2026, 2:46 PM
UpdatedMar 23, 2026, 2:46 PM
SyncedMar 23, 2026, 2:46 PM

Record Data

idlCVquUGcsU
entityIdCenter for AI Safety (CAIS)(organization)
entityDisplayName
resourceId
titleHarmBench: A Standardized Evaluation Framework for Automated Red Teaming
authorsMantas Mazeika, Long Phan, Xuwang Yin et al.
urlharmbench.org/
venue
publishedDate2024
publicationTypepaper
citationCount
isFlagshipYes
abstract
sourceharmbench.org/
notesICML 2024

Source Check Verdicts

unverifiable95% confidence

Last checked: 3/26/2026

The source text is merely a JavaScript error message from the HarmBench website and does not contain any substantive information about the publication itself. It does not confirm or contradict any of the claimed metadata (title, authors, publication date, publication type). While the URL matches the claimed domain, the source text itself provides no verifiable content about the paper's details. To verify this record, actual publication metadata (from arXiv, a conference proceedings, or similar) would be needed.

Debug info

Thing ID: lCVquUGcsU

Source Table: publications

Source ID: lCVquUGcsU