Skip to content
Longterm Wiki

Output Filtering

AI Controlactive

Post-generation safety filters that screen model outputs before delivery.

Organizations
2
Key Papers
1
Cluster: AI Control
Parent Area: AI Control

Tags

function:robustnessstage:inferencescope:technique