Page StatusRiskTable
Edited 3 days ago
20
QualityDraftQuality: 20/100Human-assigned rating of overall page quality, considering depth, accuracy, and completeness.Structure suggests 035
ImportanceReferenceImportance: 35/100How central this topic is to AI safety. Higher scores mean greater relevance to understanding or mitigating AI risk.0
Structure0/15Structure: 0/15Automated score based on measurable content features.Word count0/2Tables0/3Diagrams0/2Internal links0/2Citations0/3Prose ratio2/2Overview section0/10TablesData tables in the page0DiagramsCharts and visual diagrams0Internal LinksLinks to other wiki pages0FootnotesFootnote citations [^N] with sources0External LinksMarkdown links to outside URLs%0%Bullet RatioPercentage of content in bullet lists
Updated monthlyDue in 4 weeks
Summary
An interactive sortable table comparing 16 AI accident risks across dimensions including abstraction level, evidence, timeline, severity, and detectability. Shows risk relationships and supports grouped/unified views.
Issues1
QualityRated 20 but structure suggests 0 (overrated by 20 points)
Accident Risks Table
Columns:|
Theoretical/Mechanism/Behavior/Outcome | Evidence supporting this risk | When this risk becomes relevant | Potential severity if realized | How easy to detect | Related Risks | Overlap Notes | Key Question | ||
|---|---|---|---|---|---|---|---|---|---|
Theoretical Frameworks | Theoretical | Theoretical | Uncertain | Catastrophic | Very Difficult | enablesdeceptive-alignment enablesgoal-misgeneralization | i | ? | |
Theoretical Frameworks | Theoretical | Demonstrated Lab | Current | Existential | Moderate | enablespower-seeking enablescorrigibility-failure enablestreacherous-turn | i | ? | |
Alignment Failures | Mechanism | Demonstrated Lab | Near Term | Existential | Very Difficult | requiresmesa-optimization enablesscheming enablestreacherous-turn | i | ? | |
Alignment Failures | Mechanism | Demonstrated Lab | Current | High | Moderate | requiresmesa-optimization overlapsdistributional-shift overlapsdeceptive-alignment | i | ? | |
Specification Problems | Mechanism | Observed Current | Current | Medium | Moderate | enablessycophancy overlapsgoal-misgeneralization | i | ? | |
Specification Problems | Mechanism | Observed Current | Current | Medium | Moderate | enablesgoal-misgeneralization overlapsemergent-capabilities | i | ? | |
Specification Problems | Behavior | Observed Current | Current | Medium | Easy | special case ofreward-hacking | i | ? | |
Deceptive Behaviors | Behavior | Demonstrated Lab | Current | Catastrophic | Difficult | manifestation ofdeceptive-alignment overlapssandbagging enablestreacherous-turn | i | ? | |
Deceptive Behaviors | Behavior | Demonstrated Lab | Current | High | Difficult | special case ofscheming manifestation ofdeceptive-alignment | i | ? | |
Deceptive Behaviors | Behavior | Demonstrated Lab | Near Term | High | Very Difficult | overlapsscheming | i | ? | |
Instrumental Behaviors | Behavior | Demonstrated Lab | Current | Existential | Moderate | manifestation ofinstrumental-convergence overlapscorrigibility-failure | i | ? | |
Instrumental Behaviors | Behavior | Demonstrated Lab | Current | Catastrophic | Easy | manifestation ofinstrumental-convergence overlapspower-seeking | i | ? | |
Capability Concerns | Outcome | Observed Current | Current | High | Moderate | enablessharp-left-turn overlapsdistributional-shift | i | ? | |
Catastrophic Scenarios | Outcome | Theoretical | Medium Term | Existential | Very Difficult | requiresdeceptive-alignment requiresinstrumental-convergence requiresscheming | i | ? | |
Catastrophic Scenarios | Outcome | Speculative | Medium Term | Existential | Very Difficult | overlapsgoal-misgeneralization requiresemergent-capabilities | i | ? | |
Human-AI Interaction | Outcome | Observed Current | Current | Medium | Moderate | overlapssycophancy | i | ? |
16 risks across 8 categories