Skip to content
Longterm Wiki
Navigation
Updated 2026-02-09HistoryData
Page StatusResponse
Edited 8 weeks ago7 words1 backlinks
57QualityAdequate •52.3ImportanceUseful67ResearchModerate
Content1/13
SummaryScheduleEntityEdit historyOverview
Tables0/ ~1Diagrams0Int. links0/ ~3Ext. links0/ ~1Footnotes0/ ~2References0/ ~1Quotes0Accuracy0RatingsN:4.5 R:6.5 A:3 C:7.5Backlinks1
Issues2
QualityRated 57 but structure suggests 13 (overrated by 44 points)
StructureNo tables or diagrams - consider adding visual content

Natural Abstractions

Concept

Natural Abstractions

The hypothesis that natural abstractions converge across learning processes, aiding alignment

Related
Research Areas
Interpretability
7 words · 1 backlinks

This page is a stub. Content needed.

Related Wiki Pages

Top Related Pages

Approaches

Representation EngineeringSleeper Agent Detection

Risks

Deceptive Alignment

Analysis

Model Organisms of MisalignmentCapability-Alignment Race Model

Safety Research

Anthropic Core Views

Organizations

Anthropic

Key Debates

AI Alignment Research AgendasTechnical AI Safety Research

Other

Leon LangDan Roberts

Concepts

Dense Transformers