Skip to content
Longterm Wiki
All Source Checks
Citation

Model Organisms of Misalignment - Footnote 22

confirmed100% confidence

1 evidence check

Last checked: 4/3/2026

Migrated from citation_quotes. Original verdict: accurate

Evidence — 1 source, 1 check

confirmed100%Haiku 4.5 · 4/3/2026
Found: - **Qwen-14B**: A single rank-1 LoRA adapter applied to the MLP down-projection of layer 24 induced 9.5-21.5% misalignment while maintaining over 99.5% coherence.

Note: Migrated from citation_quotes accuracy check. Original verdict: accurate

Debug info

Record type: citation

Record ID: page:model-organisms-of-misalignment:fn22

Source Check: Model Organisms of Misalignment - Footnote 22 | Longterm Wiki