Edited 7 weeks ago274 words2 backlinksUpdated quarterlyDue in 6 weeks
34QualityDraft •Quality: 34/100LLM-assigned rating of overall page quality, considering depth, accuracy, and completeness.Structure suggests 608.5ImportancePeripheralImportance: 8.5/100How central this topic is to AI safety. Higher scores mean greater relevance to understanding or mitigating AI risk.10.5ResearchMinimalResearch Value: 10.5/100How much value deeper investigation of this topic could yield. Higher scores indicate under-explored topics with high insight potential.
Content6/13
SummarySummaryBasic text summary used in search results, entity link tooltips, info boxes, and related page cards.ScheduleScheduleHow often the page should be refreshed. Drives the overdue tracking system.EntityEntityYAML entity definition with type, description, and related entries.Add entity YAML in data/entities/Edit history3Edit historyTracked changes from improve pipeline runs and manual edits.OverviewOverviewA ## Overview heading section that orients readers. Helps with search and AI summaries.Add a ## Overview section at the top of the page
Tables3/ ~1TablesData tables for structured comparisons and reference material.Diagrams1DiagramsVisual content — Mermaid diagrams, charts, or Squiggle estimate models.Int. links4/ ~3Int. linksLinks to other wiki pages. More internal links = better graph connectivity.Ext. links0/ ~1Ext. linksLinks to external websites, papers, and resources outside the wiki.Add links to external sourcesFootnotes0/ ~2FootnotesFootnote citations [^N] with source references at the bottom of the page.Add [^N] footnote citationsReferences0/ ~1ReferencesCurated external resources linked via <R> components or cited_by in YAML.Add <R> resource linksQuotes0QuotesSupporting quotes extracted from cited sources to back up page claims.crux citations extract-quotes <id>Accuracy0AccuracyCitations verified against their sources for factual accuracy.crux citations verify <id>RatingsN:2 R:4 A:6 C:5RatingsSub-quality ratings: Novelty, Rigor, Actionability, Completeness (0-10 scale).Backlinks2BacklinksNumber of other wiki pages that link to this page. Higher backlink count means better integration into the knowledge graph.
Change History3
Route internal pages through /wiki/E<id>#1827 weeks ago
Migrated internal pages from `/internal/` to `/wiki/E<id>` URLs so they render with full wiki infrastructure (breadcrumbs, metadata, quality indicators, sidebar). Internal MDX pages now redirect from `/internal/slug` to `/wiki/E<id>`, while React dashboard pages (suggested-pages, updates, page-changes, etc.) remain at `/internal/`. Follow-up review: cleaned up dead code, hid wiki-specific UI on internal pages, fixed breadcrumbs, updated all bare-text `/internal/` references.
Add PageStatus and info boxes to internal pages#1857 weeks ago
Enabled PageStatus rendering, Data links, and Feedback widgets on internal pages by removing the `isInternal` guards in the wiki page renderer. Added `evergreen`, `update_frequency`, and `lastEdited` frontmatter to all ~40 internal pages so update schedules and staleness indicators are visible.
Internal pages entity infrastructure#1427 weeks ago
Added full entity infrastructure to internal pages (style guides, architecture docs, research reports, schema docs). Internal pages now have the `internal` entity type, get auto-assigned E* numeric IDs (E698-E731), are included in the search index, and participate in backlinks/related graph computation. Includes review fixes: filtering internal pages from public explore/home, converting all 7 remaining .md files, adding `internal` to data/schema.ts, and updating all `shouldSkipValidation`/`pageType === 'documentation'` checks.
Issues1
QualityRated 34 but structure suggests 60 (underrated by 26 points)
Response Pages Style Guide
Response pages describe interventions, policies, and technical approaches that address AI risks. They explain how something works and assess its effectiveness.
Prerequisite: All response pages must follow the Common Writing PrinciplesE726Shared writing principles referenced by all domain-specific style guides. Three pillars: epistemic honesty (hedge uncertain claims, use ranges, source confidence levels), language neutrality (avoid...0 — epistemic honesty, language neutrality, and analytical tone. The objectivity rating dimension measures this.
Page Type Detection
Response pages are at: /knowledge-base/responses/**/*.mdx
Required Frontmatter
---
title: "Response Name"
description: "One sentence explaining what this response does and its key mechanism."
quality: 60 # 0-100
readerImportance: 70 # 0-100
lastEdited: "2026-01-28"
---
Required Sections
1. Overview (2-3 paragraphs)
What is this response and why does it matter?
2. Quick Assessment Table
## Quick Assessment
| Dimension | Assessment | Evidence |
|-----------|------------|----------|
| Tractability | Medium | Requires significant research investment |
| Scalability | High | Applies to most foundation models |
| Current Maturity | Low | Early research stage |
| Time Horizon | 5-10 years | Needs fundamental advances |
| Key Proponents | Anthropic, DeepMind | Active research programs |
3. How It Works
Technical explanation with diagram:
## How It Works
<Mermaid chart={`
flowchart LR
A[Input] --> B[Safety Layer]
B --> C{Check}
C -->|Pass| D[Output]
C -->|Fail| E[Block/Modify]
`} />
[Detailed explanation of mechanism]
4. Risks Addressed
## Risks Addressed
| Risk | Relevance | How It Helps |
|------|-----------|--------------|
| [Deceptive Alignment](/...) | High | Detects hidden goals |
| [Reward Hacking](/...) | Medium | Identifies misspecified rewards |
5. Limitations
What this approach cannot do or gets wrong.
6. Current State
Who is working on this, what progress has been made.
7. Open Questions
Unsolved problems and research directions.
Claude Code Workflows
Creating a New Response Page
Task({
subagent_type: 'general-purpose',
prompt: `Create a response page for [RESPONSE_NAME].
FIRST: Read /internal/response-style-guide/ (Response Style Guide).
THEN: Research using WebSearch:
- Academic papers on the technique
- Lab blog posts and announcements
- Current implementations and results
Create at: src/content/docs/knowledge-base/responses/[category]/[name].mdx
Include:
1. Overview (2-3 paragraphs)
2. Quick Assessment table
3. How It Works (with diagram)
4. Risks Addressed table
5. Limitations
6. Current State
7. Open Questions`
})
Improving a Response Page
Task({
subagent_type: 'general-purpose',
prompt: `Improve response page at [PATH].
Read /internal/response-style-guide/ (Response Style Guide) first.
Add:
1. Quick Assessment table (if missing)
2. Mermaid diagram
3. Risks Addressed cross-links
4. Citations from recent papers
Use WebSearch for current research.`
})
Quality Criteria
Pages are scored on seven dimensions (0-10 scale). Scoring is harsh - a 7 is exceptional, most content should score 3-5.
Dimension
3-4 (Adequate)
5-6 (Good)
7+ (Exceptional)
Novelty
Accurate summary
Some original framing
Significant original insight
Rigor
Mixed sourcing
Mostly sourced
Fully sourced with quantification
Objectivity
Some insider language or false certainty
Mostly neutral, some uncertainty noted
Fully accessible, all estimates hedged
Actionability
Abstract implications
Some actionable takeaways
Concrete decision guidance
Completeness
Notable gaps
Covers main points
Thorough coverage
Derived quality (0-100) combines subscores with word count and citation bonuses. See CLAUDE.md for formula.
Example
See Mechanistic InterpretabilityResearch AreaMechanistic InterpretabilityMechanistic interpretability aims to reverse-engineer neural networks to understand internal computations, with $100M+ annual investment across major labs. Anthropic extracted 30M+ features from Cl...Quality: 59/100 for reference.