Longterm Wiki

Dan Hendrycks

dan-hendrycks (E89)
← Back to pagePath: /knowledge-base/people/dan-hendrycks/
Page Metadata
{
  "id": "dan-hendrycks",
  "numericId": null,
  "path": "/knowledge-base/people/dan-hendrycks/",
  "filePath": "knowledge-base/people/dan-hendrycks.mdx",
  "title": "Dan Hendrycks",
  "quality": 19,
  "importance": 18,
  "contentFormat": "article",
  "tractability": null,
  "neglectedness": null,
  "uncertainty": null,
  "causalLevel": null,
  "lastUpdated": "2026-01-29",
  "llmSummary": "Biographical overview of Dan Hendrycks, CAIS director who coordinated the May 2023 AI risk statement signed by major AI researchers. Covers his technical work on benchmarks (MMLU, ETHICS), robustness research, and institution-building efforts, emphasizing his focus on catastrophic AI risk as a global priority.",
  "structuredSummary": null,
  "description": "Director of CAIS, focuses on catastrophic AI risk reduction",
  "ratings": {
    "novelty": 1.5,
    "rigor": 2,
    "actionability": 1,
    "completeness": 4
  },
  "category": "people",
  "subcategory": null,
  "clusters": [
    "ai-safety",
    "governance"
  ],
  "metrics": {
    "wordCount": 1279,
    "tableCount": 1,
    "diagramCount": 0,
    "internalLinks": 11,
    "externalLinks": 0,
    "footnoteCount": 0,
    "bulletRatio": 0.64,
    "sectionCount": 33,
    "hasOverview": false,
    "structuralScore": 5
  },
  "suggestedQuality": 33,
  "updateFrequency": null,
  "evergreen": true,
  "wordCount": 1279,
  "unconvertedLinks": [],
  "unconvertedLinkCount": 0,
  "convertedLinkCount": 0,
  "backlinkCount": 2,
  "redundancy": {
    "maxSimilarity": 16,
    "similarPages": [
      {
        "id": "yoshua-bengio",
        "title": "Yoshua Bengio",
        "path": "/knowledge-base/people/yoshua-bengio/",
        "similarity": 16
      },
      {
        "id": "cais",
        "title": "CAIS (Center for AI Safety)",
        "path": "/knowledge-base/organizations/cais/",
        "similarity": 15
      },
      {
        "id": "connor-leahy",
        "title": "Connor Leahy",
        "path": "/knowledge-base/people/connor-leahy/",
        "similarity": 14
      },
      {
        "id": "jan-leike",
        "title": "Jan Leike",
        "path": "/knowledge-base/people/jan-leike/",
        "similarity": 14
      },
      {
        "id": "stuart-russell",
        "title": "Stuart Russell",
        "path": "/knowledge-base/people/stuart-russell/",
        "similarity": 14
      }
    ]
  }
}
Entity Data
{
  "id": "dan-hendrycks",
  "type": "person",
  "title": "Dan Hendrycks",
  "description": "Dan Hendrycks is the Director of the Center for AI Safety (CAIS) and one of the most prolific researchers in AI safety. His work spans technical safety research, benchmark creation, and public advocacy for taking AI risks seriously. He is known for combining rigorous empirical research with clear communication about catastrophic risks.\n\nHendrycks has made foundational contributions to AI safety evaluation. He created MMLU (Massive Multitask Language Understanding), one of the most widely-used benchmarks for measuring AI capabilities, as well as numerous benchmarks for robustness, calibration, and safety. His research on out-of-distribution detection, adversarial robustness, and AI ethics has been highly cited and influenced how the field measures progress.\n\nAs CAIS director, Hendrycks has focused on building the case for AI risk as a serious issue. He was instrumental in organizing the 2023 Statement on AI Risk, signed by hundreds of AI researchers including Turing Award winners, which stated that \"mitigating the risk of extinction from AI should be a global priority.\" His approach emphasizes engaging mainstream ML researchers and policymakers who may not be part of the existing AI safety community.\n",
  "tags": [
    "ai-safety",
    "x-risk",
    "robustness",
    "governance",
    "benchmarks",
    "compute-governance"
  ],
  "relatedEntries": [
    {
      "id": "cais",
      "type": "lab"
    },
    {
      "id": "compute-governance",
      "type": "policy"
    },
    {
      "id": "yoshua-bengio",
      "type": "researcher"
    }
  ],
  "sources": [
    {
      "title": "Dan Hendrycks' Website",
      "url": "https://hendrycks.com"
    },
    {
      "title": "Center for AI Safety",
      "url": "https://safe.ai"
    },
    {
      "title": "Statement on AI Risk",
      "url": "https://safe.ai/statement-on-ai-risk"
    },
    {
      "title": "Google Scholar Profile",
      "url": "https://scholar.google.com/citations?user=VEvOFxQAAAAJ"
    }
  ],
  "lastUpdated": "2025-12",
  "website": "https://hendrycks.com",
  "customFields": []
}
Canonical Facts (0)

No facts for this entity

External Links

No external links

Backlinks (2)
idtitletyperelationship
far-aiFAR AIlab-research
maimMAIM (Mutually Assured AI Malfunction)policy
Frontmatter
{
  "title": "Dan Hendrycks",
  "description": "Director of CAIS, focuses on catastrophic AI risk reduction",
  "sidebar": {
    "order": 15
  },
  "quality": 19,
  "llmSummary": "Biographical overview of Dan Hendrycks, CAIS director who coordinated the May 2023 AI risk statement signed by major AI researchers. Covers his technical work on benchmarks (MMLU, ETHICS), robustness research, and institution-building efforts, emphasizing his focus on catastrophic AI risk as a global priority.",
  "lastEdited": "2026-01-29",
  "importance": 18.5,
  "ratings": {
    "novelty": 1.5,
    "rigor": 2,
    "actionability": 1,
    "completeness": 4
  },
  "clusters": [
    "ai-safety",
    "governance"
  ],
  "entityType": "person"
}
Raw MDX Source
---
title: Dan Hendrycks
description: Director of CAIS, focuses on catastrophic AI risk reduction
sidebar:
  order: 15
quality: 19
llmSummary: Biographical overview of Dan Hendrycks, CAIS director who coordinated the May 2023 AI risk statement signed by major AI researchers. Covers his technical work on benchmarks (MMLU, ETHICS), robustness research, and institution-building efforts, emphasizing his focus on catastrophic AI risk as a global priority.
lastEdited: "2026-01-29"
importance: 18.5
ratings:
  novelty: 1.5
  rigor: 2
  actionability: 1
  completeness: 4
clusters: ["ai-safety","governance"]
entityType: person
---
import {DataInfoBox, DataExternalLinks, EntityLink} from '@components/wiki';

<DataExternalLinks pageId="dan-hendrycks" />

<DataInfoBox entityId="E89" />

## Background

Dan Hendrycks is the director of the <EntityLink id="E47">Center for AI Safety</EntityLink> (CAIS) and a prominent researcher focused on catastrophic and existential risks from AI. He has made significant contributions to both <EntityLink id="E297">technical AI safety research</EntityLink> and public awareness of AI risks.

Background:
- PhD in Computer Science from UC Berkeley
- Post-doc at UC Berkeley
- Founded Center for AI Safety
- Research on robustness, uncertainty, and safety

Hendrycks combines rigorous technical research with effective communication and institution-building to advance AI safety.

## Major Contributions

### Center for AI Safety (CAIS)

Founded CAIS as organization focused on:
- Reducing catastrophic risks from AI
- Technical safety research
- Public awareness and advocacy
- Connecting researchers and resources

**Impact:** CAIS has become major hub for AI safety work, coordinating research and advocacy.

### Statement on AI Risk (May 2023)

Coordinated landmark statement: "Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war."

**Signatories included:**
- <EntityLink id="E149">Geoffrey Hinton</EntityLink>
- <EntityLink id="E380">Yoshua Bengio</EntityLink>
- <EntityLink id="E269">Sam Altman</EntityLink> (<EntityLink id="E218">OpenAI</EntityLink>)
- <EntityLink id="E101">Demis Hassabis</EntityLink> (DeepMind)
- <EntityLink id="E91">Dario Amodei</EntityLink> (<EntityLink id="E22">Anthropic</EntityLink>)
- Hundreds of AI researchers

**Impact:** Massively raised profile of AI existential risk, made it mainstream concern.

### Technical Research

Significant contributions to:

**AI Safety Benchmarks:**
- ETHICS dataset - evaluating moral reasoning
- Hendrycks Test (MMLU) - measuring knowledge
- Safety-specific evaluation methods
- Adversarial robustness testing

**Uncertainty and Robustness:**
- Out-of-distribution detection
- Robustness to distribution shift
- Calibration of neural networks
- Anomaly detection

**Natural Adversarial Examples:**
- Real-world failure modes
- Testing model robustness
- Understanding generalization limits

## Research Philosophy

### Focus on Catastrophic Risk

Hendrycks emphasizes:
- Not just any AI safety issue
- Specifically catastrophic/existential risks
- High-stakes scenarios
- Long-term implications

### Empirical and Practical

Approach characterized by:
- Concrete benchmarks and metrics
- Testing on real systems
- Measurable progress
- Actionable results

### Bridging Research and Policy

Works to:
- Make research policy-relevant
- Communicate findings clearly
- Engage with policymakers
- Translate technical work to action

## Views on AI Risk

### Dan Hendrycks' Risk Assessment

Dan Hendrycks has been explicit and consistent about the severity of catastrophic risks from AI, positioning them alongside society's most pressing existential threats. His actions—founding CAIS, coordinating the May 2023 AI risk statement signed by major AI researchers, and maintaining an active research program—demonstrate his belief that technical solutions are both necessary and achievable, though time is of the essence.

| Expert/Source | Estimate | Reasoning |
|---------------|----------|-----------|
| Catastrophic risk priority | On par with pandemics and nuclear war | Hendrycks coordinated the May 2023 Statement on AI Risk which explicitly positioned extinction risk from AI as a global priority alongside pandemics and nuclear war. This framing was deliberate and endorsed by hundreds of leading AI researchers including Geoffrey Hinton, Yoshua Bengio, and the CEOs of major AI labs. The parallel to other existential risks signals that AI risk deserves similar institutional resources, research funding, and policy attention as these established threats. |
| Need for action | Urgent | Hendrycks founded the Center for AI Safety and coordinated the landmark 2023 statement specifically to accelerate action on catastrophic AI risks. His decision to focus CAIS explicitly on catastrophic and existential risks—rather than broader AI safety concerns—reflects his assessment that these high-stakes scenarios require immediate attention. The timing and prominence of the statement suggest he believes we are in a critical window where preventive measures can still be effective. |
| Technical tractability | Research can reduce risk | CAIS maintains an active research program spanning technical safety research, compute governance, and ML safety education. This investment indicates Hendrycks' belief that concrete technical work—developing robustness measures, creating safety benchmarks, and training the next generation of safety researchers—can meaningfully reduce catastrophic risks. His focus on empirical methods and measurable progress suggests optimism that systematic research can address key problems before advanced AI systems are deployed. |

### Core Concerns

1. **Catastrophic risks are real**: AI poses existential-level threats
2. **Need technical and governance solutions**: Both required
3. **Current systems already show concerning behaviors**: Problems visible now
4. **Rapid capability growth**: Moving faster than safety work
5. **Coordination challenges**: Individual labs can't solve alone

### Strategic Approach

**Multi-pronged:**
- Technical research on safety
- Public awareness and advocacy
- Policy engagement
- Field building and coordination

**Pragmatic:**
- Work with systems as they are
- Focus on measurable improvements
- Build coalitions
- Incremental progress

## CAIS Work

### Research Programs

**Technical Safety:**
- Robustness research
- Evaluation methods
- Alignment techniques
- Empirical studies

**Compute Governance:**
- Hardware-level safety measures
- Compute tracking and allocation
- <EntityLink id="E171">International coordination</EntityLink>
- Supply chain interventions

**ML Safety Course:**
- Educational curriculum
- Training next generation
- Making safety knowledge accessible
- Academic integration

### Advocacy and Communication

**Statement on AI Risk:**
- Coordinated broad consensus
- Brought issue to mainstream
- Influenced policy discussions
- Demonstrated unity in field

**Public Communication:**
- Media appearances
- Op-eds and articles
- Talks and presentations
- Social media engagement

### Field Building

**Connecting Researchers:**
- Workshops and conferences
- Research collaborations
- Funding opportunities
- Community building

## Key Publications

### Safety Benchmarks

- **"ETHICS: Measuring Ethical Reasoning in Language Models"** - Evaluating moral reasoning
- **"Measuring Massive Multitask Language Understanding" (MMLU)** - Comprehensive knowledge benchmark
- **"Natural Adversarial Examples"** - Real-world robustness testing

### Technical Safety

- **"Unsolved Problems in ML Safety"** - Research agenda
- **"Out-of-Distribution Detection"** - Methods for identifying distribution shift
- **"Robustness research"** - Multiple papers on making models more robust

### Position Papers

- **"X-Risk Analysis for AI Research"** - Framework for thinking about catastrophic risks
- **Contributions to policy discussions** - Technical input for governance

## Public Impact

### Raising Awareness

The Statement on AI Risk:
- Reached global media
- Influenced policy discussions
- Made x-risk mainstream
- Built consensus among experts

### Policy Influence

Hendrycks' work has influenced:
- Congressional testimony and hearings
- <EntityLink id="E127">EU AI Act</EntityLink> discussions
- International coordination efforts
- Industry standards

### Academic Integration

CAIS has helped:
- Make safety research academically respectable
- Create curricula and courses
- Train students in safety
- Publish in top venues

## Unique Contributions

### Consensus Building

Exceptional at:
- Bringing together diverse groups
- Finding common ground
- Building coalitions
- Coordinating action

### Communication

Skilled at:
- Explaining technical concepts clearly
- Reaching different audiences
- Media engagement
- Policy translation

### Pragmatic Approach

Focuses on:
- What can actually be done
- Working with current systems
- Measurable progress
- Building bridges

## Current Priorities at CAIS

1. **Technical safety research**: Advancing robustness and alignment
2. **Compute governance**: Hardware-level interventions
3. **Public awareness**: Maintaining pressure on the issue
4. **Policy engagement**: Influencing regulation and governance
5. **Field building**: Growing the safety research community

## Evolution of Focus

**Early research:**
- Robustness and uncertainty
- Benchmarks and evaluation
- Academic ML research

**Growing safety focus:**
- Increasingly concerned about risks
- Founded CAIS
- More explicit about catastrophic risks

**Current:**
- Explicitly focused on x-risk
- Leading advocacy efforts
- Building coalitions
- Policy engagement

## Criticism and Challenges

**Some argue:**
- Focus on catastrophic risk might neglect near-term harms
- Statement was too brief/vague
- Consensus might paper over important disagreements

**Supporters argue:**
- X-risk deserves special focus
- Brief statement was strategically effective
- Consensus demonstrates seriousness of concern

**Hendrycks' approach:**
- X-risk is priority but not only concern
- Brief statement was feature, not bug
- Diversity of views compatible with shared concern

## Vision for the Field

Hendrycks envisions:
- AI safety as central to AI development
- Strong safety standards and regulations
- International coordination on AI
- Technical solutions to catastrophic risks
- Safety research well-funded and respected