Skip to content
Longterm Wiki

Improving Alignment and Robustness with Circuit Breakers

publicationVerified

Metadata

Source Tablepublications
Source IDK87cheyygx
DescriptionAndy Zou, Long Phan, Justin Wang et al., 2024
Source URLarxiv.org/abs/2406.04313
ParentCenter for AI Safety (CAIS)
Children
CreatedMar 23, 2026, 2:46 PM
UpdatedMar 23, 2026, 2:46 PM
SyncedMar 23, 2026, 2:46 PM

Record Data

idK87cheyygx
entityIdCenter for AI Safety (CAIS)(organization)
entityDisplayName
resourceId
titleImproving Alignment and Robustness with Circuit Breakers
authorsAndy Zou, Long Phan, Justin Wang et al.
urlarxiv.org/abs/2406.04313
venue
publishedDate2024
publicationTypepaper
citationCount
isFlagshipNo
abstract
sourcearxiv.org/abs/2406.04313
notesICML 2024

Source Check Verdicts

confirmed99% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: K87cheyygx

Source Table: publications

Source ID: K87cheyygx

Parent Thing ID: sid_y4bieqSeag