International AI Safety Summits

Event

International AI Safety Summit Series

Part of AI Governance & Policy (Overview)

Three international AI safety summits (2023-2025) achieved first formal recognition of catastrophic AI risks from 28+ countries, established 10+ AI Safety Institutes with $100-400M combined budgets, and secured voluntary commitments from 16 companies covering ~80% of frontier AI development. However, all commitments remain non-binding with no enforcement mechanisms, and the coalition is fracturing (US/UK refused Paris 2025 declaration), with estimated 15-30% probability of binding frameworks by 2030.

EA Forum

StatusActive

First SummitBletchley Park, UK (Nov 2023)

Second SummitSeoul, South Korea (May 2024)

Third SummitParis, France (Feb 2025)

FormatGovernment-led, multi-stakeholder

Policies

Organizations

4.7k words · 6 backlinks

Quick Assessment

Dimension	Assessment	Evidence
Enforcement Mechanism	None—entirely voluntary	All declarations and commitments are non-binding; no penalties for non-compliance
Participation Rate	High initial, declining alignment	28 countries at Bletchley → US/UK refused Paris declaration; China abstained at Seoul
Institutional Output	10+ AI Safety Institutes established	Combined budgets of $100-400M annually; UK AISI leads with ≈$65M/year
Corporate Engagement	16-20 frontier AI companies committed	Covers approximately 80% of global frontier AI development capacity
Framework Publication Rate	60% of signatories	12 of 20 companies published safety frameworks by December 2025
Governance Speed vs AI Development	Mismatched	Summit cycle: 6-12 months; AI capability advances: continuous
Probability of Binding Framework by 2030	15-30%	Historical precedent: NPT took 20 years; Montreal Protocol took 2 years

Summit Series Overview

Summit	Date	Location	Participants	Key Outcome	Declaration Signatories
Bletchley Park	Nov 1-2, 2023	UK	100+ delegates	Bletchley Declaration on AI risks	28 countries + EU
Seoul	May 21-22, 2024	South Korea	Lower attendance	Frontier AI Safety Commitments	27 countries + EU
Paris	Feb 10-11, 2025	France	1,000+ stakeholders	Statement on Inclusive AI	58 countries (excl. US, UK)
Delhi (planned)	Feb 2026	India	Projected 1,000+	AI Impact focus	Projected 30+ countries

Comprehensive Overview

The International AI Safety Summit series represents humanity's first sustained diplomatic effort to coordinate global governance of advanced artificial intelligence systems that could pose catastrophic or existential risks. Beginning with the Bletchley Park Summit in November 2023, these convenings have brought together 28+ countries—including major AI powers like the United States, China, United Kingdom, and European Union—alongside leading AI companies, researchers, and civil society organizations to establish international dialogue on AI safety.

These summits matter profoundly for AI safety because they represent the primary mechanism through which governments are attempting to coordinate responses to potentially catastrophic AI risks. Unlike previous technology governance efforts that emerged after harms materialized, the summits represent an unprecedented attempt at proactive international cooperation on a rapidly advancing technology with a global market estimated at $184 billion in 2024 and projected to reach $826 billion by 2030. They have produced several key outcomes: formal recognition by major powers that AI poses "potential for serious, even catastrophic harm"; establishment of 10+ AI Safety Institutes with combined annual budgets of $100-400 million (UK: ≈$65M; US: ≈$10M; others: ≈$5-15M each); voluntary commitments from 16-20 major AI companies covering approximately 80% of frontier AI development capacity; and ongoing diplomatic channels for coordination despite geopolitical tensions.

However, the summits face fundamental limitations that constrain their immediate impact on existential risk reduction. All commitments remain voluntary and non-binding, with no enforcement mechanisms or penalties for non-compliance—a concern highlighted by the Carnegie Endowment's analysis characterizing the process as "summit pageantry" rather than substantive progress. Geopolitical tensions, particularly between the US and China, limit the depth of cooperation possible. The summits must balance speed of technological development—where AI capabilities advance weekly—against the inherently slow pace of international diplomacy that operates on annual cycles. Current assessments suggest the summits contribute an estimated 15-30% toward eventual binding international frameworks, serving as necessary but insufficient steps toward comprehensive AI governance.

The Bletchley Park Breakthrough (November 2023)

The inaugural AI Safety Summit at Bletchley Park from November 1-2, 2023, marked a watershed moment in international AI governance. According to the UK Government summary, approximately 150 representatives from national governments, industry, academia, and civil society attended—including US Vice President Kamala Harris and European Commission President Ursula von der Leyen. Hosted by the UK government at the historic World War II codebreaking center, the summit achieved something unprecedented: getting 28 countries, including both the United States and China, to formally acknowledge that advanced AI poses catastrophic risks requiring international cooperation.

The summit's core achievement was the Bletchley Declaration, signed by all 28 participating countries plus the EU, which contained the first formal international recognition that frontier AI models pose "potential for serious, even catastrophic, harm, either deliberate or unintentional." Signatories included the US, UK, China, France, Germany, India, Japan, Australia, Singapore, UAE, Brazil, Canada, Saudi Arabia, and the Republic of Korea. This language represented a diplomatic breakthrough, as it required countries with vastly different political systems and values to agree on the reality of AI catastrophic risks. The declaration specifically identified risks including cybersecurity threats, biotechnology misuse, widespread misinformation, and crucially, "loss of control of AI systems"—the first time loss of control scenarios received formal international acknowledgment.

Beyond symbolic recognition, Bletchley produced concrete institutional commitments. The UK announced creation of its AI Safety Institute with an initial £100 million investment in the Frontier AI Taskforce—the largest funding for AI safety by any country at that time. The United States committed to establishing its own AI Safety Institute, formalized shortly after in President Biden's October 30, 2023 Executive Order. Most significantly for the AI industry, major companies including OpenAI, Google DeepMind, Anthropic, Microsoft, and Meta pledged to provide pre-deployment access to their most advanced models for safety testing by government institutes—marking the first formal agreement for government oversight of frontier AI development.

Seoul's Operational Framework (May 2024)

The Seoul AI Safety Summit on May 21-22, 2024, represented the crucial transition from principles to operational frameworks. Co-hosted by South Korea and the UK, Seoul expanded participation while focusing specifically on frontier AI systems—defined as models requiring over 10^26 floating-point operations for training. This technical specificity reflected growing government sophistication in understanding AI capabilities and risks, moving beyond general statements to precise technological parameters.

Seoul's headline achievement was securing Frontier AI Safety Commitments from 16 leading AI companies. These commitments went beyond Bletchley's access provisions to establish ongoing accountability frameworks. Companies pledged to "invest proportionate resources in AI safety research," "work with governments, civil society, and other stakeholders," and crucially, "publish frameworks detailing how they will approach frontier AI safety." In the extreme, organizations commit not to develop or deploy a model if mitigations cannot keep risks below defined thresholds.

Frontier AI Safety Commitments Signatories

Company	Headquarters	Published Safety Framework
Amazon	US	Yes
Anthropic	US	Yes (RSP)
Cohere	Canada	Yes
G42	UAE	Yes
Google DeepMind	US/UK	Yes
IBM	US	No
Inflection AI	US	No
Meta	US	Yes
Microsoft	US	Yes
Mistral AI	France	No
Naver	South Korea	Yes
NVIDIA	US	Yes (joined later)
OpenAI	US	Yes
Samsung Electronics	South Korea	No
Technology Innovation Institute	UAE	No
xAI	US	Yes
Zhipu.ai	China	No

The summit also launched the International AI Safety Research Network, connecting AI Safety Institutes globally to share evaluation methodologies, coordinate pre-deployment testing, and develop common technical standards. This network represents the operational backbone of international AI safety coordination, enabling real-time information sharing about emerging capabilities and risks. Ten countries and the EU pledged to establish AI Safety Institutes. Japan, South Korea, Singapore, France, and Canada launched institutes, while the EU announced its newly established AI Office would fulfill that role.

Seoul demonstrated the evolution of international AI governance from high-level diplomacy to technical cooperation. Working groups emerged focused on concrete challenges: developing standardized evaluation protocols for dangerous capabilities, establishing information-sharing frameworks that protect commercial interests while enabling safety coordination, and creating mechanisms for rapid response if concerning capabilities emerge. Notably, while attending the summit, China abstained from the Seoul Declaration commitments, signaling early divergence in participation.

Paris AI Action Summit: Divergence Emerges (February 2025)

The AI Action Summit in Paris on February 10-11, 2025, co-chaired by French President Emmanuel Macron and Indian Prime Minister Narendra Modi, marked a significant shift in the summit process. Renamed from "AI Safety Summit" to "AI Action Summit," the gathering brought together over 1,000 participants from more than 100 countries. However, the summit revealed growing fractures in international AI governance alignment.

The central outcome was the Statement on Inclusive and Sustainable Artificial Intelligence for People and the Planet, signed by 61 countries including France, China, and India. The statement outlined principles including accessibility, ethical and trustworthy AI development, avoiding market concentration, positive labor market outcomes, environmental sustainability, and international cooperation. Critically, the United States and United Kingdom refused to sign—the first time either country declined such global AI statements since the summit series began, marking what the European Policy Centre termed a potential "Au revoir" to global AI safety coordination.

US Vice President J.D. Vance addressed world leaders stating, "This administration will ensure that American AI technology continues to be the gold standard worldwide," while denouncing regulation that could hamper growth. Vance made clear the Trump Administration "cannot and will not" accept foreign governments "tightening the screws on US tech companies." The American delegation specifically objected to references to AI existential risk, environmental impact, and a role for the UN.

Paris Summit Key Initiatives

Initiative	Description	Funding/Scale
Current AI Foundation	Support for AI "public goods" including datasets and open-source tools	$100M initial investment
Coalition for Sustainable AI	Environmental impact partnership led by France, UNEP, and ITU	11 countries, 5 intl. orgs, 37 companies
International AI Safety Report	Expert report on AI capabilities and risks	96 experts from 30 countries
French Investment Commitments	Total AI investment secured	€109B over coming years

The Current AI foundation is backed by nine governments (Finland, France, Germany, Chile, India, Kenya, Morocco, Nigeria, Slovenia, Switzerland) plus philanthropies and companies including Google and Salesforce. China announced establishment of a new body as its answer to AI safety institute calls, though functioning differently from US and UK models. India was announced as host of the next summit—the AI Impact Summit—in February 2026.

Institutional Infrastructure and AI Safety Institutes

The summit series has catalyzed unprecedented institutional capacity building for AI safety governance. The UK AI Safety Institute (renamed AI Security Institute in February 2025), operational since late 2023, has conducted pre-deployment evaluations of frontier models including OpenAI's o1 and Anthropic's Claude 3.5 Sonnet. In May 2024, AISI released the open-source Inspect AI Framework for standardized safety evaluations.

Global AI Safety Institutes Comparison

Country	Institute	Established	Annual Budget	Staff Size	Key Focus
United Kingdom	AI Security Institute (AISI)	Nov 2023	≈$65M	100+	Pre-deployment testing, technical standards
United States	CAISI (formerly AISI)	Nov 2023	≈$10M	30-50	Standards development, innovation promotion
Japan	AI Safety Institute	Feb 2024	≈$10M	20-30	Safety guidelines, international coordination
South Korea	AI Safety Institute	May 2024	≈$10M	20-30	Frontier model evaluation
Singapore	AI Verify Foundation	2022	≈$5M	15-20	Testing toolkit, governance framework
France	Within AI Office	2024	≈$8M	15-25	EU AI Act implementation
Canada	AI Safety Institute	2024	≈$10M	15-25	Research coordination
EU	AI Office	2024	≈$15M	50+	AI Act enforcement, GPAI oversight

Note: The UK's budget is approximately 6-10x larger than other national AISIs, reflecting its early mover advantage and government prioritization. US funding requests faced congressional cuts despite initial Biden administration proposals.

UK AISI Frontier AI Trends Report (December 2025)

The UK AISI's inaugural evidence-based assessment revealed rapid capability advancement:

Capability Area	2023 Performance	2025 Performance	Trend
Cyber (apprentice-level tasks)	≈9% success	≈50% success	Doubling every 8 months
Cyber (expert-level tasks)	0%	First completion in 2025	New threshold crossed
Software engineering (hour-long tasks)	<5% completion	>40% completion	8x improvement
Biology/chemistry knowledge	Below PhD-level	Outperforming PhD researchers	Exceeds human expert
Safeguard robustness	Baseline	40x variation in jailbreak effort	Improving but variable

Key finding: "The duration of some cyber tasks that AI systems can complete without human direction is roughly doubling every eight months." However, vulnerabilities have been found in every system tested.

The US AI Safety Institute, established within NIST, initially received only $10 million in March 2024—approximately one-tenth of the UK's budget despite the US hosting the world's leading AI companies. According to TIME Magazine's analysis, the UK body has "around 10 times the budget of the U.S. government's own AI Safety Institute." The Biden administration requested $47.7 million increase for FY2025, but congressional appropriators cut further. In 2025, the US AI Safety Institute was renamed the Center for AI Standards and Innovation (CAISI), with its mission transformed under the Trump administration.

Beyond these flagship institutes, the summit process has spurred institutional development across multiple countries. The EU AI Office, while primarily focused on implementing the AI Act, has expanded its mandate to include frontier AI evaluation. Japan, South Korea, Singapore, France, and Canada have launched AI Safety Institutes. Established AISIs report similar funding and staff size, with annual budgets generally around $10 million with notable exceptions (UK significantly higher).

This institutional infrastructure represents the most tangible outcome of the summit process. Unlike voluntary company commitments or non-binding declarations, AI Safety Institutes create permanent government capacity to understand, evaluate, and potentially regulate advanced AI systems. Their technical work provides the foundation for evidence-based policy making and establishes precedents for how governments can maintain oversight of rapidly advancing AI capabilities.

Summit Process Evolution

Diagram (loading…)

flowchart TD
  subgraph BLETCHLEY["Bletchley Park (Nov 2023)"]
      B1[28 Countries + EU Sign Declaration]
      B2[First Recognition of Catastrophic Risk]
      B3[UK/US Announce AI Safety Institutes]
  end

  subgraph SEOUL["Seoul (May 2024)"]
      S1[16 Companies Sign Safety Commitments]
      S2[International AISI Network Launched]
      S3[China Abstains from Declaration]
  end

  subgraph PARIS["Paris (Feb 2025)"]
      P1[58 Countries Sign Statement]
      P2[US and UK Refuse to Sign]
      P3[$100M Current AI Foundation]
  end

  subgraph DELHI["Delhi (Feb 2026)"]
      D1[AI Impact Summit]
      D2[Global South Focus]
  end

  BLETCHLEY --> SEOUL
  SEOUL --> PARIS
  PARIS --> DELHI

  style B1 fill:#90EE90
  style S1 fill:#90EE90
  style P2 fill:#FFB6C1
  style S3 fill:#FFD700

The diagram illustrates the evolution from broad consensus at Bletchley to emerging divergence at Paris, with China abstaining at Seoul and the US/UK refusing to sign at Paris—representing a potential fracturing of the initial coalition.

Binding vs Non-Binding Outcomes Comparison

Summit	Binding Commitments	Non-Binding Commitments	Enforcement Mechanism
Bletchley 2023	0	Declaration (28 countries), AISI announcements	None
Seoul 2024	0	Declaration (27 countries), Frontier Safety Commitments (16 companies)	None; relies on reputational pressure
Paris 2025	0	Statement (61 countries—excluding US/UK), Current AI Foundation ($400M)	None
EU AI Act (for comparison)	Full legal framework	N/A	Fines up to €35M or 7% of global turnover

Key insight: After three summits spanning 15 months and involving 100+ countries, zero binding commitments have been achieved. The only binding AI governance framework remains the EU AI Act, which was developed independently of the summit process.

Industry Engagement and Corporate Commitments

The summit series has achieved unprecedented industry participation in international AI safety governance. The Seoul commitments from 16 major companies (later expanded to 20) represent roughly 80% of global frontier AI development capacity, including OpenAI (GPT family), Google DeepMind (Gemini), Anthropic (Claude), Microsoft, Meta, Amazon, and emerging players like xAI and Mistral. This broad industry engagement reflects both growing corporate recognition of AI risks and strategic calculations about the benefits of cooperative governance versus adversarial regulation.

Corporate commitments fall into several categories with varying implementation prospects. Transparency commitments appear most likely to be honored, as companies have begun publishing detailed safety frameworks and capability assessments. OpenAI's "Preparedness Framework," Anthropic's "Responsible Scaling Policy," and Google DeepMind's "Frontier Safety Framework" all emerged partially from summit discussions and represent substantive technical documents outlining safety practices. These publications enable external evaluation of company practices and create reputational pressure for follow-through.

Pre-deployment testing agreements represent the most operationally significant commitments, granting AI Safety Institutes access to evaluate models before public release. Early evidence suggests mixed compliance: the UK AI Safety Institute reports conducting evaluations of several frontier models, while some companies have delayed providing access to their most advanced systems. The challenge lies in balancing legitimate commercial confidentiality with meaningful safety evaluation, particularly as model evaluation requires understanding not just capabilities but training processes and intended deployment strategies.

Investment commitments in safety research remain difficult to verify without detailed financial disclosure. Industry spending on AI safety has increased substantially, with companies reporting 5-15% of AI research budgets dedicated to safety work—though these figures are self-reported and definitions of "safety work" vary significantly. For context, frontier AI labs spend an estimated $1-10 billion annually on AI research and development, suggesting safety investments of $50-500 million across major companies—comparable to the entire government AISI ecosystem. The summit process has established expectations for safety investment but lacks enforcement mechanisms beyond reputational pressure and potential future regulatory requirements.

Geopolitical Dynamics: US-China-EU Triangulation

The summit series has revealed complex and shifting geopolitical dynamics, with different powers adopting varied positions across summits:

Country/Bloc	Bletchley 2023	Seoul 2024	Paris 2025	Current Stance
United States	Signed declaration	Signed; supported commitments	Refused to sign	Opposing regulation; renamed AISI
United Kingdom	Host; signed	Co-host; signed	Refused to sign	Renamed AISI to "AI Security Institute"
China	Signed declaration	Abstained from Seoul Declaration	Signed Paris statement	Established own AI safety body
EU	Signed	Signed; AI Office joined network	Signed	Strong regulatory approach (AI Act)
France	Signed	Signed	Host; signed	€109B investment commitment
India	Signed	Signed	Co-chair; signed	Hosting 2026 summit

China's participation trajectory is particularly notable. Chinese Vice Premier Zhang Guoqing attended Paris, and China signed the inclusive AI statement that the US and UK refused. China announced establishment of a new body as its answer to AI safety institute calls, though functioning differently from Western models. The success of DeepSeek demonstrated that smaller AI outfits can drive innovation, reinforcing China's position that market concentration concerns are legitimate.

The US position shifted dramatically with the Trump administration. Vice President Vance's Paris statement that the administration "cannot and will not" accept foreign governments "tightening the screws on US tech companies" signals a fundamental reorientation away from multilateral AI governance. The American delegation specifically objected to references to AI existential risk, environmental impact, and UN involvement.

Despite these tensions, some cooperation channels remain. The UK and US AI Safety Institutes conducted joint pre-deployment evaluations of OpenAI's o1 and Anthropic's Claude 3.5 Sonnet in late 2024. Technical working groups continue identifying common ground on evaluation methodologies. However, the Paris divergence suggests potential bifurcation into competing governance frameworks—one led by the US emphasizing innovation and market freedom, another involving the EU, China, and Global South emphasizing regulation and equity.

Current State and Near-Term Trajectory (2025-2026)

Following the Paris summit in February 2025, the summit process has entered a new phase characterized by divergent national approaches. The renaming from "AI Safety Summit" to "AI Action Summit" and the US/UK refusal to sign the Paris declaration signals a potential fracturing of the initial coalition. India will host the next summit—the AI Impact Summit—in February 2026, likely further broadening focus beyond pure safety concerns.

The institutional infrastructure created by the summits is reaching operational maturity but facing political headwinds. The UK AISI's Frontier AI Trends Report (December 2025) provides the most comprehensive government assessment of AI capabilities to date. However, the US AI Safety Institute's transformation into the Center for AI Standards and Innovation (CAISI) under the Trump administration represents a significant shift in US government approach, with reduced emphasis on safety evaluation and increased focus on promoting US AI competitiveness.

Corporate commitments are showing mixed implementation. Twelve of 20 signatory companies have published frontier AI safety policies (as of December 2025): Anthropic, OpenAI, Google DeepMind, Magic, Naver, Meta, G42, Cohere, Microsoft, Amazon, xAI, and NVIDIA. Pre-deployment testing agreements face practical challenges, but the UK/US joint evaluations of Claude 3.5 Sonnet and o1 demonstrate that government-industry cooperation on safety evaluation remains possible despite political shifts.

The most significant development is the emerging governance bifurcation. The US is moving toward a deregulatory approach emphasizing AI competitiveness. The EU continues implementing the AI Act with strong regulatory requirements. China is developing its own governance framework emphasizing AI sovereignty. This fragmentation may lead to a "Brussels effect" dynamic where EU standards shape global practices, or alternatively to regulatory arbitrage where AI development concentrates in the least-regulated jurisdictions.

Medium-Term Prospects and Binding Frameworks (2025-2030)

The critical question for the summit process is whether voluntary commitments can evolve into binding international agreements capable of meaningfully constraining AI development if catastrophic risks materialize. According to Lawfare's analysis, this transition is possible but difficult, typically requiring 5-15 years of relationship building, technical development, and crisis motivation. Historical precedents suggest varied timelines: the Nuclear Non-Proliferation Treaty required two decades of preliminary efforts; the Montreal Protocol addressing ozone depletion took only two years but addressed a simpler technical problem with clear scientific consensus.

Several factors could accelerate movement toward binding frameworks. A near-miss AI incident—perhaps an AI system causing significant economic damage or demonstrating unexpectedly dangerous capabilities—could provide crisis motivation for stronger international action. Technical advances in AI evaluation and verification could address current challenges in monitoring compliance with safety requirements. Generational changes in government leadership might reduce resistance to international constraints on AI development.

Conversely, several factors could prevent binding agreements. Intensifying geopolitical competition between the US and China could make meaningful cooperation impossible. Rapid AI advancement could outpace diplomatic processes, making governance frameworks obsolete before implementation. Commercial resistance to binding constraints could weaken government commitment to international agreements. Absence of clear AI-caused harm could undermine political momentum for stronger action.

The most likely scenario involves gradual strengthening of existing frameworks rather than comprehensive binding treaties. According to the CSIS analysis of the AI Safety Institute International Network, technical cooperation through AI Safety Institutes is likely to deepen, creating shared methodologies and informal coordination mechanisms. Industry commitments may become more specific and verifiable, particularly as governments develop stronger oversight capabilities. Regional initiatives—particularly the EU AI Act (which entered force in 2024 as the world's first comprehensive AI legal framework) and potential US-Canada-UK coordination—could create de facto international standards through market pressure.

Safety Implications and Risk Assessment

From an AI safety perspective, the summit process generates both encouraging and concerning implications for existential risk reduction. On the positive side, the summits have achieved unprecedented international recognition of AI catastrophic risks, established government institutions with technical capacity to evaluate dangerous AI capabilities, and created diplomatic channels for coordination that could prove crucial if severe risks materialize. The process has also demonstrated that international cooperation on AI safety remains possible despite broader geopolitical tensions.

Summit Effectiveness Assessment

Outcome Category	Achievement Level	Strength	Key Limitation
Risk Recognition	High	First international acknowledgment of "loss of control" risks by 28+ nations	Recognition does not equal action
Institutional Capacity	Medium-High	10+ AISIs with combined $100-400M budgets	Funding disparities; US underfunded relative to AI industry presence
Corporate Accountability	Medium	16+ companies signed commitments; 60% published frameworks	Voluntary; no verification or penalties
Information Sharing	Medium	International AISI Network operational; joint UK-US evaluations	Limited to frontier labs with cooperation agreements
Enforcement Mechanisms	None	—	No binding requirements; no consequences for non-compliance
Speed of Response	Low	Summit cycle: 6-12 months	AI capabilities advance weekly; governance fundamentally outpaced
Coalition Durability	Declining	Initial 28-country consensus	US/UK refusing Paris; China selective participation

However, the limitations are severe. All current commitments remain voluntary and non-binding, providing no constraints on actors willing to develop dangerous AI systems regardless of international pressure. The summit process operates on annual timelines while AI capabilities advance continuously, creating fundamental mismatches between governance speed and technological development. Verification and enforcement mechanisms remain weak, potentially enabling cheating or free-riding on safety commitments.

The risk of false assurance represents a particular concern. Successful summits and industry commitments could create public perception that AI risks are being adequately managed, potentially reducing pressure for stronger action. As the Center for AI Safety Newsletter noted, "voluntary commitments aren't entirely toothless, but they often need to be more robust to ensure meaningful compliance." Similarly, the emphasis on international cooperation could delay necessary unilateral actions by countries or companies concerned about AI risks. The summit process might serve corporate interests in avoiding binding regulation while providing minimal actual constraint on dangerous development. Max Tegmark of MIT and the Future of Life Institute has argued that "self-regulation simply isn't working, and the only solution is legally binding safety standards like we have for medicine, food, and airplanes."

Quantitative risk assessment suggests the summits contribute an estimated 15-30% probability increase toward achieving effective international AI governance by 2030, with wide uncertainty ranges (90% confidence interval: 5-50%). This contribution operates primarily through institutional capacity building (estimated at 40-60% of total impact), norm development (20-30%), and relationship establishment (15-25%) rather than immediate risk reduction. The summits are necessary but insufficient for AI safety, requiring parallel efforts on technical safety research, domestic regulation, and industry safety practices. By comparison, the EU AI Act alone may contribute 20-40% toward effective governance of AI systems deployed within its jurisdiction, though with limited reach outside Europe.

Key Uncertainties and Critical Variables

Several fundamental uncertainties will determine the ultimate success or failure of the international summit process. The durability of US-China cooperation represents perhaps the most important variable, as meaningful AI governance likely requires coordination between the world's two leading AI powers. Current engagement remains fragile and could collapse if geopolitical tensions escalate further or if AI technology becomes perceived as decisive for military or economic competition.

The speed of AI development relative to governance capability presents another critical uncertainty. If AI systems achieve dangerous capabilities much faster than anticipated—potentially through algorithmic breakthroughs rather than just scaling—the current summit timelines may prove inadequate. Conversely, if AI development plateaus or faces technical barriers, the summit process could have more time to develop effective governance frameworks.

The occurrence of warning shots or near-miss events could dramatically accelerate international cooperation or, conversely, trigger competitive dynamics that undermine collaboration. AI systems causing significant harm—whether through accidents, misuse, or unexpected capabilities—could provide the crisis motivation necessary for binding international agreements. However, such events could also trigger national security responses that prioritize competitive advantage over cooperative governance.

Industry behavior and technological concentration represent additional critical variables. The current concentration of advanced AI development among a small number of companies makes governance potentially easier but also creates single points of failure. If AI capabilities become more distributed across many actors or countries, international coordination could become much more difficult. Corporate responses to governance pressure—whether cooperative engagement or adversarial resistance—will significantly influence the summit process's effectiveness.

Technical progress in AI evaluation and safety could determine whether international governance remains feasible as AI systems become more advanced. Current evaluation methodologies struggle to assess the most dangerous potential capabilities, and verification of safety practices remains challenging. Breakthroughs in AI evaluation, interpretability, or alignment could enable much more effective international oversight, while technical barriers could make governance frameworks increasingly ineffective.

The interaction of these uncertainties creates wide probability distributions around potential outcomes, suggesting that while the summit process has established important foundations for international AI governance, its ultimate impact on existential risk reduction remains highly uncertain and dependent on future developments largely outside the direct control of summit participants.

Sources and Further Reading

References

1Seoul Declaration for Safe, Innovative and Inclusive AIUK Government·Government▸

The Seoul Declaration is an international agreement reached at the AI Seoul Summit on 21 May 2024, building on the Bletchley Park process, in which world leaders committed to safe, innovative, and inclusive AI development. It includes a Statement of Intent toward International Cooperation on AI Safety Science, signaling multilateral commitment to coordinated AI safety research and governance.

★★★★☆

gov.uk

2France's AI Action SummitCSIS▸

CSIS analyst Laura Caroli examines the Paris AI Action Summit (February 10-11, 2025), analyzing how it differed from previous UK and Seoul summits by shifting focus from AI safety to AI adoption, innovation, and investment. The piece explores France's optimistic framing of AI governance under Macron's leadership and its implications for international AI coordination.

★★★★☆

csis.org

3Bletchley DeclarationUK Government·Government▸

The Bletchley Declaration is a landmark multilateral agreement signed by 28 countries at the UK's AI Safety Summit in November 2023, establishing shared recognition of AI's risks and opportunities. It represents the first major international consensus document specifically focused on frontier AI safety, committing signatories to cooperative risk assessment and governance frameworks.

★★★★☆

gov.uk

4The Singapore Consensus on Global AI Safety Research Prioritiesaisafetypriorities.org▸

A consensus document from the 2025 Singapore Conference on AI, co-authored by 88 researchers including Bengio, Russell, and Tegmark, organizing AI safety research into a defence-in-depth framework covering Assessment, Development, and Control. It identifies cooperative research priorities where even geopolitical competitors share mutual interest in shared risk thresholds, evaluation protocols, and safety benchmarks. The document builds on the 2025 International AI Safety Report and addresses risks up to and including AGI.

aisafetypriorities.org

5AISI Frontier AI TrendsUK AI Safety Institute·Government▸

A UK AI Safety Institute government assessment documenting exponential performance improvements across frontier AI systems in multiple domains. The report evaluates emerging capabilities and associated risks, calling for robust safeguards as systems advance rapidly. It serves as an official benchmark of the current frontier AI landscape from a national safety authority.

★★★★☆

aisi.gov.uk

6AI Lab Watch: Commitments Trackerailabwatch.org▸

AI Lab Watch's Commitments Tracker monitors and evaluates the public safety commitments made by major AI laboratories, tracking whether frontier AI companies are honoring pledges related to safety, governance, and responsible deployment. It serves as an accountability tool by systematically documenting what labs have promised and assessing follow-through.

ailabwatch.org

7Seoul Frontier AI CommitmentsUK Government·Government▸

A collection of voluntary safety commitments made by leading AI companies at the AI Seoul Summit 2024, building on the Bletchley Declaration. Companies pledge to publish safety frameworks, conduct pre-deployment evaluations, share safety information, and establish responsible scaling thresholds before deploying frontier AI models.

★★★★☆

gov.uk

8AI Safety Summits OverviewFuture of Life Institute▸

This Future of Life Institute page provides an overview of international AI Safety Summits, tracking major government-led convenings aimed at coordinating global policy responses to advanced AI risks. It serves as a reference hub for understanding the diplomatic and governance landscape emerging around frontier AI safety.

★★★☆☆

futureoflife.org

9The Paris Summit: Au Revoir, global AI Safety?epc.eu▸

This EPC analysis examines the 2025 Paris AI Action Summit and assesses whether it represented a retreat from the safety-focused agenda established at earlier summits in Bletchley Park and Seoul. The piece evaluates geopolitical dynamics, notably US and French positioning, and their impact on international AI governance momentum. It raises concerns about whether the global coalition around AI safety is fracturing.

epc.eu

10METR's analysis of 12 companiesMETR▸

METR analyzes the safety policies of 12 frontier AI companies to identify common elements, commitments, and gaps in how organizations approach responsible deployment of advanced AI systems. The analysis synthesizes patterns across responsible scaling policies, model cards, and safety frameworks to provide a comparative overview of industry norms. It serves as a reference for understanding where consensus exists and where significant variation or absence of commitments remains.

★★★★☆

metr.org

11DeepMind: Deepening AI Safety Research with UK AISIGoogle DeepMind▸

DeepMind announces an expanded collaboration with the UK AI Security Institute (AISI) to advance AI safety research, focusing on evaluations, red-teaming, and safety testing of frontier AI models. The partnership aims to develop shared methodologies and tools for assessing risks from advanced AI systems.

★★★★☆

deepmind.google

12Pre-Deployment evaluation of OpenAI's o1 modelUK AI Safety Institute·Government▸

The US and UK AI Safety Institutes jointly conducted a pre-deployment safety evaluation of OpenAI's o1 reasoning model, assessing its capabilities in cyber, biological, and software development domains. The evaluation benchmarked o1 against reference models to identify potential risks before public release. This represents an early example of government-led pre-deployment AI safety testing through formal institute collaboration.

★★★★☆

aisi.gov.uk

13The AI Seoul Summit: Analysis and OutcomesCSIS▸

A CSIS analysis of the AI Seoul Summit, examining the outcomes of international AI governance diplomacy as a follow-up to the UK AI Safety Summit at Bletchley Park. The piece likely covers multilateral commitments, safety frameworks, and geopolitical dynamics shaping global AI governance efforts.

★★★★☆

csis.org

14UK AI Safety Institute's Inspect frameworkinspect.aisi.org.uk▸

Inspect is an open-source framework developed by the UK AI Safety Institute (AISI) for evaluating large language models and AI systems. It provides standardized tools for running safety evaluations, benchmarks, and red-teaming tasks. The framework enables researchers and developers to assess AI model capabilities and safety properties in a reproducible and extensible way.

inspect.aisi.org.uk

15AI Safety Summit 2023UK Government·Government▸

The official UK government page for the AI Safety Summit 2023, held November 1-2 at Bletchley Park, which convened governments, AI companies, civil society, and researchers to address frontier AI risks. Key outputs include the Bletchley Declaration—a multilateral agreement on AI safety—company safety policies, and a frontier AI capabilities and risks discussion paper. The summit marked a landmark moment in international AI governance coordination.

★★★★☆

gov.uk

16Carnegie analysis warnsCarnegie Endowment▸

Carnegie Endowment analysis examines whether high-profile AI safety summits (like Bletchley Park and Seoul) translate into meaningful governance progress or remain largely ceremonial. The piece evaluates the gap between international AI governance rhetoric and substantive policy coordination, arguing that geopolitical competition risks turning AI governance into a performative arms race rather than genuine risk reduction.

★★★★☆

carnegieendowment.org

17UK AI Safety Institute WikipediaWikipedia·Reference▸

Wikipedia article covering the UK AI Safety Institute (AISI), a government body established in 2023 to advance AI safety research and evaluation. It provides an overview of the institute's mission, structure, key activities such as frontier model evaluations, and its role in international AI safety coordination. The article serves as a reference point for understanding the UK's institutional approach to governing advanced AI.

★★★☆☆

en.wikipedia.org

18The AI Safety Institute International Network: Next StepsCSIS▸

This CSIS analysis examines the international network of AI Safety Institutes established across multiple countries and provides recommendations for strengthening their coordination, scope, and effectiveness. It addresses how these institutes can better collaborate on technical safety evaluations and policy alignment to address frontier AI risks.

★★★★☆

csis.org

19AI Safety NewsletterCenter for AI Safety▸

This newsletter issue analyzes the Frontier AI Safety Commitments agreed upon at the Seoul AI Summit, arguing that voluntary RSPs (Responsible Scaling Policies) are insufficient as a primary safety mechanism. It also covers a Senate AI Policy Roadmap and provides an overview of catastrophic AI risks.

★★★★☆

newsletter.safe.ai

20FLI AI Safety Index Summer 2025Future of Life Institute▸

The Future of Life Institute's AI Safety Index Summer 2025 systematically evaluates leading AI companies on safety practices, finding widespread deficiencies across risk management, transparency, and existential safety planning. Anthropic receives the highest grade of C+, indicating that even the best-performing company falls significantly short of adequate safety standards. The report serves as a comparative benchmark for industry accountability.

★★★☆☆

futureoflife.org

International AI Safety Summits