Deployment & Control (Overview)
Deployment methods focus on maintaining safety during AI system operation.
Containment:
- Sandboxing: Isolating AI systems from the outside world
- AI Control: Maintaining human oversight and control
Access Management:
- Structured Access: Tiered access to model capabilities
- Tool Restrictions: Limiting available actions and tools
Output Safety:
- Output Filtering: Screening model outputs for harm
Multi-System:
- Multi-Agent Safety: Safety in systems with multiple AI agents