All Source Checks
Citation
Model Organisms of Misalignment - Footnote 23
partial90% confidence
1 evidence check
Last checked: 4/3/2026
The claim states 'Qwen2.5-32B-Instruct' achieved the misalignment, but the source says 'Qwen-14B' achieved it. The claim says 'up to 40% misalignment', but the source says 'over 40% misalignment'.
Evidence — 1 source, 1 check
arxiv.org/html/2506.11613v1(1 check)
partial90%Haiku 4.5 · 4/3/2026
Found: - **Qwen2.5-32B-Instruct**: Achieved up to 40% misalignment with 99% coherence using narrow training datasets (bad medical advice, risky financial advice, extreme sports recommendations).
Note: The claim states 'Qwen2.5-32B-Instruct' achieved the misalignment, but the source says 'Qwen-14B' achieved it. The claim says 'up to 40% misalignment', but the source says 'over 40% misalignment'.
Debug info
Record type: citation
Record ID: page:model-organisms-of-misalignment:fn23