Output Filtering
AI ControlactivePost-generation safety filters that screen model outputs before delivery.
Organizations
2
Key Papers
1
Cluster: AI Control
Parent Area: AI Control
Tags
function:robustnessstage:inferencescope:technique
Post-generation safety filters that screen model outputs before delivery.