Skip to content
Longterm Wiki
All Source Checks
Citation

Goodfire - Footnote 31

confirmed90% confidence

1 evidence check

Last checked: 4/3/2026

Migrated from citation_quotes. Original verdict: accurate

Evidence — 1 source, 1 check

confirmed90%Haiku 4.5 · 4/3/2026
Found: Goodfire's model diffing and auditing tools aim to identify rare, undesired behaviors—such as a model encouraging self-harm—that might emerge during training or deployment. However, there is ongoing d

Note: Migrated from citation_quotes accuracy check. Original verdict: accurate

Debug info

Record type: citation

Record ID: page:goodfire:fn31

Source Check: Goodfire - Footnote 31 | Longterm Wiki