Chris Olah
chris-olahpersonPath: /knowledge-base/people/chris-olah/
E59Entity ID (EID)
Page Recorddatabase.json — merged from MDX frontmatter + Entity YAML + computed metrics at build time
{
"id": "chris-olah",
"wikiId": "E59",
"path": "/knowledge-base/people/chris-olah/",
"filePath": "knowledge-base/people/chris-olah.mdx",
"title": "Chris Olah",
"quality": 27,
"readerImportance": 79,
"researchImportance": 39,
"tacticalValue": 80,
"contentFormat": "article",
"causalLevel": null,
"lastUpdated": "2026-03-16",
"dateCreated": "2026-02-15",
"summary": "Comprehensive biographical profile of Chris Olah covering his unconventional career path, foundational contributions to mechanistic interpretability (feature visualization, circuit analysis, sparse autoencoders), and leadership of Anthropic's interpretability team; notably includes substantive external criticism of the research program's scalability and safety relevance alongside acknowledgment of Anthropic's institutional conflict of interest.",
"description": "Co-founder of Anthropic and researcher in neural network interpretability, known for developing mechanistic interpretability as a research program",
"ratings": {
"novelty": 2,
"rigor": 3.5,
"completeness": 5,
"actionability": 2
},
"category": "people",
"subcategory": "safety-researchers",
"clusters": [
"ai-safety"
],
"metrics": {
"wordCount": 3604,
"tableCount": 1,
"diagramCount": 0,
"internalLinks": 52,
"externalLinks": 13,
"footnoteCount": 29,
"bulletRatio": 0.15,
"sectionCount": 23,
"hasOverview": true,
"structuralScore": 13
},
"suggestedQuality": 87,
"updateFrequency": 45,
"evergreen": true,
"wordCount": 3604,
"unconvertedLinks": [
{
"text": "colah.github.io",
"url": "https://colah.github.io",
"resourceId": "kb-37bc73d0870ec93a",
"resourceTitle": "Colah's Blog (Christopher Olah)"
},
{
"text": "colah.github.io",
"url": "https://colah.github.io",
"resourceId": "kb-37bc73d0870ec93a",
"resourceTitle": "Colah's Blog (Christopher Olah)"
},
{
"text": "transformer-circuits.pub",
"url": "https://transformer-circuits.pub",
"resourceId": "5083d746c2728ff2",
"resourceTitle": "Transformer Circuits Thread"
},
{
"text": "transformer-circuits.pub",
"url": "https://transformer-circuits.pub",
"resourceId": "5083d746c2728ff2",
"resourceTitle": "Transformer Circuits Thread"
},
{
"text": "colah.github.io",
"url": "https://colah.github.io",
"resourceId": "kb-37bc73d0870ec93a",
"resourceTitle": "Colah's Blog (Christopher Olah)"
},
{
"text": "colah.github.io",
"url": "https://colah.github.io",
"resourceId": "kb-37bc73d0870ec93a",
"resourceTitle": "Colah's Blog (Christopher Olah)"
},
{
"text": "distill.pub",
"url": "https://distill.pub/2020/circuits/zoom-in/",
"resourceId": "346b1574c0c3ce67",
"resourceTitle": "Zoom In: An Introduction to Circuits"
},
{
"text": "transformer-circuits.pub",
"url": "https://transformer-circuits.pub/2024/scaling-monosemanticity/",
"resourceId": "e724db341d6e0065",
"resourceTitle": "Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet"
},
{
"text": "transformer-circuits.pub",
"url": "https://transformer-circuits.pub",
"resourceId": "5083d746c2728ff2",
"resourceTitle": "Transformer Circuits Thread"
}
],
"unconvertedLinkCount": 9,
"convertedLinkCount": 0,
"backlinkCount": 24,
"hallucinationRisk": {
"level": "medium",
"score": 60,
"factors": [
"biographical-claims",
"low-rigor-score",
"low-quality-score",
"well-cited"
]
},
"entityType": "person",
"redundancy": {
"maxSimilarity": 18,
"similarPages": [
{
"id": "miri-era",
"title": "The MIRI Era (2000-2015)",
"path": "/knowledge-base/history/miri-era/",
"similarity": 18
},
{
"id": "interpretability",
"title": "Mechanistic Interpretability",
"path": "/knowledge-base/responses/interpretability/",
"similarity": 18
},
{
"id": "deep-learning-era",
"title": "Deep Learning Revolution (2012-2020)",
"path": "/knowledge-base/history/deep-learning-era/",
"similarity": 17
},
{
"id": "anthropic",
"title": "Anthropic",
"path": "/knowledge-base/organizations/anthropic/",
"similarity": 17
},
{
"id": "goodfire",
"title": "Goodfire",
"path": "/knowledge-base/organizations/goodfire/",
"similarity": 17
}
]
},
"changeHistory": [
{
"date": "2026-02-23",
"branch": "feat/batch-improve-high-risk-pages",
"title": "Auto-improve (standard): Chris Olah",
"summary": "Improved \"Chris Olah\" via standard pipeline (1357.1s). Quality score: 74. Issues resolved: Frontmatter contains malformed XML-like tags in 'llmSummary'; Overview section contains duplicate 'name' attribute on Enti; Co-founding Anthropic section also contains duplicate 'name'.",
"duration": "1357.1s",
"cost": "$5-8"
},
{
"date": "2026-02-17",
"branch": "claude/review-wiki-editing-scCul",
"title": "Wiki editing system refactoring",
"summary": "Six refactors to the wiki editing pipeline: (1) extracted shared regex patterns to `crux/lib/patterns.ts`, (2) refactored validation in page-improver to use in-process engine calls instead of subprocess spawning, (3) split the 694-line `phases.ts` into 7 individual phase modules under `phases/`, (4) created shared LLM abstraction `crux/lib/llm.ts` unifying duplicated streaming/retry/tool-loop code, (5) added Zod schemas for LLM JSON response validation, (6) decomposed 820-line mermaid validation into `crux/lib/mermaid-checks.ts` (604 lines) + slim orchestrator (281 lines). Follow-up review integrated patterns.ts across 19+ files, fixed dead imports, corrected ToolHandler type, wired mdx-utils.ts to use shared patterns, replaced hardcoded model strings with MODELS constants, replaced `new Anthropic()` with `createLlmClient()`, replaced inline `extractText` implementations with shared `extractText()` from llm.ts, integrated `MARKDOWN_LINK_RE` into link validators, added `objectivityIssues` to the `AnalysisResult` type (removing an unsafe cast in utils.ts), fixed CI failure from eager client creation, and tested the full pipeline by improving 3 wiki pages. After manual review of 3 improved pages, fixed 8 systematic pipeline issues: (1) added content preservation instructions to prevent polish-tier content loss, (2) made auto-grading default after --apply, (3) added polish-tier citation suppression to prevent fabricated citations, (4) added Quick Assessment table requirement for person pages, (5) added required Overview section enforcement, (6) added section deduplication and content repetition checks to review phase, (7) added bare URL→markdown link conversion instruction, (8) extended biographical claim checker to catch publication/co-authorship and citation count claims.\n\nSubsequent iterative testing and prompt refinement: ran pipeline on jan-leike, chris-olah, far-ai pages. Discovered and fixed: (a) `<!-- NEEDS CITATION -->` HTML comments break MDX compilation (changed to `{/* NEEDS CITATION */}`), (b) excessive citation markers at polish tier — added instruction to only mark NEW claims (max 3-5 per page), (c) editorial meta-comments cluttering output — added no-meta-comments instruction, (d) thin padding sections — added anti-padding instruction, (e) section deduplication needed stronger emphasis — added merge instruction with common patterns. Final test results: jan-leike 1254→1997 words, chris-olah 1187→1687 words, far-ai 1519→2783 words, miri-era 2678→4338 words; all MDX compile, zero critical issues.",
"pr": 184
}
],
"coverage": {
"passing": 7,
"total": 13,
"targets": {
"tables": 14,
"diagrams": 1,
"internalLinks": 29,
"externalLinks": 18,
"footnotes": 11,
"references": 11
},
"actuals": {
"tables": 1,
"diagrams": 0,
"internalLinks": 52,
"externalLinks": 13,
"footnotes": 29,
"references": 8,
"quotesWithQuotes": 0,
"quotesTotal": 0,
"accuracyChecked": 0,
"accuracyTotal": 0
},
"items": {
"summary": "green",
"schedule": "green",
"entity": "green",
"editHistory": "green",
"overview": "green",
"tables": "amber",
"diagrams": "red",
"internalLinks": "green",
"externalLinks": "amber",
"footnotes": "green",
"references": "amber",
"quotes": "red",
"accuracy": "red"
},
"editHistoryCount": 2,
"ratingsString": "N:2 R:3.5 A:2 C:5"
},
"readerRank": 93,
"researchRank": 348,
"recommendedScore": 112.4
}External Links
{
"grokipedia": "https://grokipedia.com/page/Chris_Olah"
}Backlinks (24)
| id | title | type | relationship |
|---|---|---|---|
| anthropic | Anthropic | organization | research |
| anthropic | Anthropic | organization | — |
| goodfire | Goodfire | organization | — |
| connor-leahy | Connor Leahy | person | — |
| interpretability | Interpretability | research-area | research |
| mech-interp | Mechanistic Interpretability | research-area | research |
| accident-risks | AI Accident Risk Cruxes | crux | — |
| interpretability-sufficient | Is Interpretability Sufficient for Safety? | crux | — |
| deep-learning-era | Deep Learning Revolution (2012-2020) | historical | — |
| anthropic-impact | Anthropic Impact Assessment Model | analysis | — |
| worldview-intervention-mapping | Worldview-Intervention Mapping | analysis | — |
| anthropic-investors | Anthropic (Funder) | analysis | — |
| anthropic-stakeholders | Anthropic Stakeholders | table | — |
| anthropic-valuation | Anthropic Valuation Analysis | analysis | — |
| frontier-ai-comparison | Frontier AI Company Comparison (2026) | concept | — |
| daniela-amodei | Daniela Amodei | person | — |
| dario-amodei | Dario Amodei | person | — |
| evan-hubinger | Evan Hubinger | person | — |
| __index__/knowledge-base/people | People | concept | — |
| neel-nanda | Neel Nanda | person | — |
| alignment | AI Alignment | approach | — |
| anthropic-core-views | Anthropic Core Views | safety-agenda | — |
| corporate-influence | Corporate Influence on AI Policy | crux | — |
| mesa-optimization | Mesa-Optimization | risk | — |