All Source Checks
Citation
Model Organisms of Misalignment - Footnote 5
partial90% confidence
1 evidence check
Last checked: 4/3/2026
The source does not mention a 40% misalignment rate. The source does not mention a 14B parameter model.
Evidence — 1 source, 1 check
arxiv.org/abs/2506.11613(1 check)
partial90%Haiku 4.5 · 4/3/2026
Found: Recent work has produced model organisms achieving 99% coherence (compared to 67% in earlier attempts) while exhibiting 40% misalignment rates, using models as small as 0.5B parameters. These improved…
Note: The source does not mention a 40% misalignment rate. The source does not mention a 14B parameter model.
Debug info
Record type: citation
Record ID: page:model-organisms-of-misalignment:fn5