Page StatusContent

Edited 2 weeks ago1.6k words1 backlinks

Updated every 3 weeksDue in 6 days

Summary

Conjecture is a 30-40 person London-based AI safety org founded 2021, pursuing Cognitive Emulation (CoEm) - building interpretable AI from ground-up rather than aligning LLMs - with $30M+ Series A funding. Founded by Connor Leahy (EleutherAI), they face high uncertainty about CoEm competitiveness (3-5 year timeline) and commercial pressure risks.

Issues1

QualityRated 37 but structure suggests 67 (underrated by 30 points)

Conjecture

Safety Org

Conjecture

LessWrong EA Forum

TypeSafety Org

Founded2022

LocationLondon, UK

Websiteconjecture.dev

People

Safety Agendas

Organizations

1.6k words · 1 backlinks

Safety Org

Conjecture

LessWrong EA Forum

TypeSafety Org

Founded2022

LocationLondon, UK

Websiteconjecture.dev

People

Safety Agendas

Organizations

1.6k words · 1 backlinks

Overview

Conjecture is an AI safety research organization founded in 2021 by Connor Leahy and a team of researchers concerned about existential risks from advanced AI. The organization pursues a distinctive technical approach centered on "Cognitive Emulation" (CoEm) - building interpretable AI systems based on human cognition principles rather than aligning existing large language models.

Based in London with a team of 30-40 researchers, Conjecture raised over $10M in Series A funding in 2023. Their research agenda emphasizes mechanistic interpretability and understanding neural network internals, representing a fundamental alternative to mainstream prosaic alignment approaches pursued by organizations like Anthropic and OpenAI.

Aspect	Assessment	Evidence	Source
Technical Innovation	High	Novel CoEm research agenda	Conjecture Blog↗
Funding Security	Strong	$30M+ Series A (2023)	TechCrunch Reports↗
Research Output	Moderate	Selective publication strategy	Research Publications↗
Influence	Growing	European AI policy engagement	UK AISI↗

Risk Assessment

Risk Category	Severity	Likelihood	Timeline	Trend
CoEm Uncompetitive	High	Moderate	3-5 years	Uncertain
Commercial Pressure Compromise	Medium	High	2-3 years	Worsening
Research Insularity	Low	Moderate	Ongoing	Stable
Funding Sustainability	Medium	Low	5+ years	Improving

Founding and Evolution

Origins (2021)

Conjecture emerged from the EleutherAI collective, an open-source AI research group that successfully recreated GPT-3 as open-source models (GPT-J, GPT-NeoX). Key founding factors:

Factor	Impact	Details
EleutherAI Experience	High	Demonstrated capability replication feasibility
Safety Concerns	High	Recognition of risks from capability proliferation
European Gap	Medium	Limited AI safety ecosystem outside Bay Area
Funding Availability	Medium	Growing investor interest in AI safety

Philosophical Evolution: The transition from EleutherAI's "democratize AI" mission to Conjecture's safety-focused approach represents a significant shift in thinking about AI development and publication strategies.

Funding Trajectory

Year	Funding Stage	Amount	Impact
2021	Seed	Undisclosed	Initial team of ≈15 researchers
2023	Series A	$30M+	Scaled to 30-40 researchers
2024	Operating	Ongoing	Sustained research operations

Cognitive Emulation (CoEm) Research Agenda

Core Philosophy

Conjecture's signature approach contrasts sharply with mainstream AI development:

Approach	Philosophy	Methods	Evaluation
Prosaic Alignment	Train powerful LLMs, align post-hoc	RLHF, Constitutional AI	Behavioral testing
Cognitive Emulation	Build interpretable systems from ground up	Human cognition principles	Mechanistic understanding

Key Research Components

Mechanistic Interpretability

Circuit discovery in neural networks
Feature attribution and visualization
Scaling interpretability to larger models
Interpretability research collaboration

Architecture Design

Modular systems for better control
Interpretability-first design choices
Trading capabilities for understanding
Novel training methodologies

Model Organisms

Smaller, interpretable test systems
Alignment property verification
Deception detection research
Goal representation analysis

Key Personnel

Leadership Team

Connor Leahy

CEO and Co-founder

EleutherAI, autodidact ML researcher

Sid Black

Co-founder

EleutherAI technical researcher

Gabriel Alfour

CTO

Former Tezos CTO, systems engineering

Connor Leahy Profile

Aspect	Details
Background	EleutherAI collective member, GPT-J contributor
Evolution	From open-source advocacy to safety-focused research
Public Role	Active AI policy engagement, podcast appearances
Views	Short AI timelines, high P(doom), interpretability-necessary

Timeline Estimates: Leahy has consistently expressed short AI timeline views, suggesting AGI within years rather than decades.

Research Focus Areas

Mechanistic Interpretability

Research Area	Status	Key Questions
Circuit Analysis	Active	How do transformers implement reasoning?
Feature Extraction	Ongoing	What representations emerge in training?
Scaling Methods	Development	Can interpretability scale to AGI-level systems?
Goal Detection	Early	How can we detect goal-directedness mechanistically?

Comparative Advantages

Organization	Primary Focus	Interpretability Approach
Conjecture	CoEm, ground-up interpretability	Design-time interpretability
Anthropic	Frontier models + interpretability	Post-hoc analysis of LLMs
ARC	Theoretical alignment	Evaluation and ELK research
Redwood	AI control	Interpretability for control

Strategic Position

Theory of Change

Conjecture's pathway to AI safety impact:

Develop scalable interpretability techniques for powerful AI systems
Demonstrate CoEm viability as competitive alternative to black-box scaling
Influence field direction toward interpretability-first development
Inform governance with technical feasibility insights
Build safe systems using CoEm principles if successful

European AI Safety Hub

Role	Impact	Examples
Geographic Diversity	High	Alternative to Bay Area concentration
Policy Engagement	Growing	UK AISI consultation
Talent Development	Moderate	European researcher recruitment
Community Building	Early	Workshops and collaborations

Challenges and Criticisms

Technical Feasibility

Challenge	Severity	Status
CoEm Competitiveness	High	Unresolved - early stage
Interpretability Scaling	High	Active research question
Human Cognition Complexity	Medium	Ongoing investigation
Timeline Alignment	High	Critical if AGI timelines short

Organizational Tensions

Commercial Pressure vs Safety Mission

VC funding creates return expectations
Potential future deployment pressure
Comparison to Anthropic's commercialization path

Publication Strategy Criticism

Shift from EleutherAI's radical openness
Selective research sharing decisions
Balance between transparency and safety

Current Research Outputs

Published Work

Type	Focus	Impact
Technical Papers	Interpretability methods	Research community
Blog Posts	CoEm explanations	Public understanding
Policy Contributions	Technical feasibility	Governance decisions
Open Source Tools	Interpretability software	Research ecosystem

Research Questions

Key Questions

?Can CoEm produce AI systems competitive with scaled LLMs?
?Is mechanistic interpretability sufficient for AGI safety verification?
?How will commercial pressures affect Conjecture's research direction?
?What role should interpretability play in AI governance frameworks?
?Can cognitive emulation bridge neuroscience and AI safety research?
?How does CoEm relate to other alignment approaches like Constitutional AI?

Timeline and Risk Estimates

Leadership Risk Assessments

Conjecture's leadership has articulated clear views on AI timelines and safety approaches, which fundamentally motivate their Cognitive Emulation research agenda and organizational strategy:

Expert/Source	Estimate	Reasoning
Connor Leahy	AGI: 2-10 years	Leahy has consistently expressed short AI timeline views across multiple public statements and podcasts from 2023-2024, suggesting transformative AI systems could emerge within years rather than decades. These short timelines create urgency for developing interpretability-first approaches before AGI arrives.
Connor Leahy	P(doom): High without major changes	Leahy has expressed significant concern about the default trajectory of AI development in 2023 statements, arguing that prosaic alignment approaches pursued by frontier labs are insufficient to ensure safety. This pessimism about conventional alignment motivates Conjecture's alternative CoEm approach.
Conjecture Research	Prosaic alignment: Insufficient	The organization's core research direction reflects a fundamental assessment that post-hoc alignment of large language models through techniques like RLHF and Constitutional AI cannot provide adequate safety guarantees. This view, maintained since founding, drives their pursuit of interpretability-first system design.
Organization	Interpretability: Necessary for safety	Conjecture's founding premise holds that mechanistic interpretability is not merely useful but necessary for AI safety verification. This fundamental research assumption distinguishes them from organizations pursuing behavioral safety approaches and shapes their entire technical agenda.

Future Scenarios

Research Trajectory Projections

Timeline	Optimistic	Realistic	Pessimistic
2-3 years	CoEm demonstrations, policy influence	Continued interpretability advances	Commercial pressure compromises
3-5 years	Competitive interpretable systems	Mixed results, partial success	Research agenda stagnates
5+ years	Field adoption of CoEm principles	Portfolio contribution to safety	Marginalized approach

Critical Dependencies

Factor	Importance	Uncertainty
Technical Feasibility	Critical	High - unproven at scale
Funding Continuity	High	Medium - VC expectations
AGI Timeline	Critical	High - if very short, insufficient time
Field Receptivity	Medium	Medium - depends on results

Relationships and Collaborations

Within AI Safety Ecosystem

Organization	Relationship	Collaboration Type
Anthropic	Friendly competition	Interpretability research sharing
ARC	Complementary	Different technical approaches
MIRI	Aligned concerns	Skepticism of prosaic alignment
Academic Labs	Collaborative	Interpretability technique development

Policy and Governance

UK Engagement

UK AI Safety Institute consultation
Technical feasibility assessments
European AI Act discussions

International Influence

Growing presence in global AI safety discussions
Alternative perspective to US-dominated discourse
Technical grounding for governance approaches

Sources & Resources

Primary Sources

Type	Source	Description
Official Website	Conjecture.dev↗	Research updates, team information
Research Papers	Google Scholar↗	Technical publications
Blog Posts	Conjecture Blog↗	Research explanations, philosophy
Interviews	Connor Leahy Talks↗	Leadership perspectives

Secondary Analysis

Type	Source	Focus
AI Safety Analysis	LessWrong Posts↗	Community discussion
Technical Reviews	Alignment Forum↗	Research evaluation
Policy Reports	GovAI Analysis↗	Governance implications
Funding News	TechCrunch Coverage↗	Business developments

Related Resources

Topic	Internal Links	External Resources
Interpretability	Technical Interpretability	Anthropic Interpretability↗
Alignment Approaches	Why Alignment is Hard	AI Alignment Forum↗
European AI Policy	UK AISI	EU AI Office↗
Related Orgs	Safety Organizations	AI Safety Community↗

Conjecture

Conjecture

Conjecture

Overview

Risk Assessment

Founding and Evolution

Origins (2021)

Funding Trajectory

Cognitive Emulation (CoEm) Research Agenda

Core Philosophy

Key Research Components

Key Personnel

Leadership Team

Connor Leahy Profile

Research Focus Areas

Mechanistic Interpretability

Comparative Advantages

Strategic Position

Theory of Change

European AI Safety Hub

Challenges and Criticisms

Technical Feasibility

Organizational Tensions

Current Research Outputs

Published Work

Research Questions

Key Questions

Timeline and Risk Estimates

Leadership Risk Assessments

Future Scenarios

Research Trajectory Projections

Critical Dependencies

Relationships and Collaborations

Within AI Safety Ecosystem

Policy and Governance

Sources & Resources

Primary Sources

Secondary Analysis

Related Resources

Related Pages

Top Related Pages

Interpretability

ControlAI

UK AI Safety Institute

Connor Leahy

Anthropic

People

Labs

Safety Research

Approaches

Analysis

Organizations

Concepts

Risks

Key Debates

Historical

Transition Model