Corporate Influence on AI Policy

Crux

Corporate Influence on AI Policy

Comprehensive analysis of corporate influence pathways (working inside labs, shareholder activism, whistleblowing) showing mixed effectiveness: safety teams influenced GPT-4 delays and responsible scaling policies, but ~50% of OpenAI's safety staff departed in 2024 and the November 2023 board crisis demonstrated commercial pressures override safety concerns. Provides specific compensation data ($115K-$190K for researchers), talent flow metrics (8x more likely to leave OpenAI for Anthropic), and detailed assessment that 1,500-2,500 people work in safety roles globally with 60% in SF Bay Area.

EA Forum

CategoryDirect engagement with AI companies

Time to ImpactImmediate to 3 years

Key LeverageInside access and relationships

Risk LevelMedium-High

Counterfactual ComplexityVery High

Organizations

Risks

3.3k words · 6 backlinks

Overview

Direct corporate influence represents one of the most immediate and controversial approaches to AI safety: working within or pressuring frontier AI labs to make safer decisions about developing and deploying advanced AI systems. Rather than building governance structures or conducting independent research, this approach attempts to shape the behavior of the organizations that are actually building potentially transformative AI systems.

The theory is compelling in its directness—if OpenAI, Anthropic, Google DeepMind, and other frontier labs are the entities closest to developing AGI, then influencing their decisions may be the most direct path to reducing existential risk. This could mean joining their safety teams, using shareholder pressure, exposing dangerous practices through whistleblowing, or advocating for better safety culture from within.

However, this approach involves significant moral complexity. Critics argue that working at frontier labs provides legitimacy and talent to organizations engaged in a dangerous race toward AGI, potentially accelerating risks even when intending to reduce them. The effectiveness depends heavily on whether safety-conscious individuals can meaningfully influence critical decisions, or whether competitive pressures ultimately override safety considerations. Current evidence suggests mixed results: while safety teams have influenced some deployment decisions and led to responsible scaling policies, they have also struggled to prevent concerning incidents like the OpenAI board crisis of November 2023 or the dissolution of OpenAI's Superalignment team in 2024.

Quick Assessment

Dimension	Assessment	Evidence
Tractability	Medium	Significant barriers to entry; influence often limited by commercial pressures
Neglectedness	Low	Well-funded with competitive compensation; 1,500-2,500 people in safety-relevant roles at frontier labs
Scale of Impact	Potentially High	Direct proximity to critical decisions, but influence often overridden
Counterfactual	Uncertain	Would roles be filled by less safety-conscious candidates? Evidence unclear
Career Capital	High	Technical skills, network access, and inside knowledge remain valuable regardless
Moral Hazard	Significant	Legitimization of racing dynamics; perspective capture risks

Strategic Landscape and Mechanisms

Working Inside Frontier Labs

The most direct form of corporate influence involves joining frontier AI labs, particularly in safety-focused roles. This approach has grown significantly since 2020, with major labs now employing hundreds of people on safety-related work.

Frontier AI Lab Safety Staff (2024-2025)

Lab	Total Staff	Safety Team Size	Safety %	Notable Changes	Risk Management Score
Anthropic	≈1,100 (2025)	150-300 estimated	15-25%	Grew from 300 to 950 in 2024; intentionally slowed hiring	35% (highest)
Google DeepMind	≈6,600	30-50 AGI alignment + additional safety teams	≈1-2%	New AI Safety and Alignment org formed Feb 2024; team grew 37% in 2024	20%
OpenAI	≈4,400 (2025)	~16 AGI safety (down from ≈30)	<1%	Superalignment team disbanded May 2024; nearly 50% safety staff departed	33%
Meta AI	≈3,000+	Unknown	Unknown	Minimal public safety commitments	22%
xAI	≈200-400	Unknown	Unknown	No public safety framework	Not rated

Risk management scores from SaferAI assessment↗; "No AI company scored better than 'weak.'"

Anthropic self-describes as "an AI safety and research company" and maintains dedicated Interpretability, Alignment, Societal Impacts, and Frontier Red Teams. Google DeepMind has an AGI Safety Council led by co-founder Shane Legg, plus a Responsibility and Safety Council and Ethics and Society unit. OpenAI's safety landscape remains concerning following the dissolution of its Superalignment team in May 2024 and the departure of key safety leaders including Ilya Sutskever, Jan Leike, and Miles Brundage.

Safety roles typically fall into several categories, each with different risk-benefit profiles. Core safety researchers work on alignment, interpretability, and evaluation problems with direct access to frontier models. Their influence comes through developing safety techniques, informing responsible scaling policies, and providing technical input on deployment decisions.

Compensation at Frontier AI Labs (2024)

Role Type	Anthropic	OpenAI	DeepMind	Notes
Research Scientist	$115K-$160K	$100K-$165K	$150K-$100K	Interpretability, alignment focus
Research Engineer	$115K-$190K	$150K-$130K	$100K-$150K	Up to $190K at Anthropic for senior
Software Engineer	$100K-$159K	$150K-$150K	$100K-$100K	High variance based on seniority
Policy/Trust & Safety	$198K-$150K	$150K-$100K	$150K-$180K	Lower than technical roles
Median Total Comp	$145K	≈$100K	≈$100K	Includes base, bonus, equity

Sources: Levels.fyi↗, AI Paygrades↗, company job postings. Negotiation can increase offers 30-77%.

Safety-adjacent roles include policy positions that shape lab stances on regulation, communications roles that frame AI safety for public consumption, and security positions preventing model theft and misuse. These roles carry lower complicity risks since they don't directly advance capabilities, but also typically have less technical influence over core safety decisions.

The most controversial category involves capabilities researchers and engineers who directly advance AI performance. Some safety advocates argue these roles are net negative regardless of individual intentions, since they accelerate the timeline to potentially dangerous systems. Others contend that having safety-conscious people in capabilities roles is crucial for ensuring safety considerations are integrated into fundamental research directions rather than bolted on afterward.

Evidence for insider influence comes from several documented cases. Safety teams influenced the delayed release of GPT-4 in 2023, conducted extensive red-teaming that identified concerning capabilities, and contributed to the development of responsible scaling policies at multiple labs. However, the limits of this influence were also demonstrated during OpenAI's November 2023 board crisis, where safety concerns about rushing deployment were ultimately overridden by investor and employee pressure to reinstate Sam Altman.

Corporate Influence Pathways

Diagram (loading…)

flowchart TD
  subgraph Internal["Internal Influence"]
      SAFETY[Safety Team] --> DEPLOY[Deployment Decisions]
      SAFETY --> RSP[Responsible Scaling Policies]
      SAFETY --> EVAL[Model Evaluations]
      CAPS[Capabilities Staff] --> CULTURE[Safety Culture]
  end

  subgraph External["External Pressure"]
      INVESTOR[Investors] --> BOARD[Board/Governance]
      WHISTLE[Whistleblowing] --> MEDIA[Media/Public]
      REG[Regulators] --> COMPLIANCE[Compliance Requirements]
  end

  subgraph Outcomes["Outcomes"]
      DEPLOY --> DELAY[Delayed Releases]
      BOARD --> LEADERSHIP[Leadership Changes]
      MEDIA --> PRESSURE[Public Pressure]
      COMPLIANCE --> INVEST[Safety Investment]
  end

  BOARD --> DEPLOY
  PRESSURE --> BOARD

  style SAFETY fill:#cfc
  style WHISTLE fill:#ffc
  style INVESTOR fill:#ccf

Shareholder Activism and Governance Pressure

Shareholder activism remains largely untapped due to the private nature of most frontier labs, but presents significant theoretical leverage.

The OpenAI Board Crisis (November 2023): A Case Study

On November 17, 2023, OpenAI's board removed CEO Sam Altman, citing concerns that he was "not consistently candid in his communications" and steering the company away from its safety-focused mission. Within five days, he was reinstated after massive investor and employee pressure:

Day	Event	Key Actors
Nov 17	Board removes Altman; cites safety concerns	Board (Toner, McCauley, Sutskever, D'Angelo)
Nov 18	Microsoft and investors press for reinstatement↗	Microsoft ($10B+ invested), Thrive Capital
Nov 19	Emmett Shear named interim CEO; board holds firm	OpenAI board
Nov 20	700+ of 770 employees sign letter threatening resignation↗	90% of staff
Nov 22	Altman reinstated; board reconstituted	New board excludes Toner, McCauley; Sutskever later departs

Key lesson: Investor pressure and employee revolt overwhelmed governance structures explicitly designed to prioritize safety. The board members who orchestrated the removal—except D'Angelo—were replaced. Sutskever, who initially supported the removal, departed in May 2024 to found Safe Superintelligence Inc↗.

Most frontier labs remain private or are subsidiaries of larger companies, limiting direct shareholder pressure. Anthropic is privately held with significant investment from Google and Amazon. OpenAI operates under an unusual capped-profit structure but remains largely privately controlled. Only Google (parent of DeepMind) and Microsoft (OpenAI's key partner) are fully public companies where traditional shareholder activism could apply, but AI represents a small fraction of their overall business.

The potential for shareholder influence may increase as the AI industry matures. Bloomberg Intelligence projects↗ global ESG assets will reach $10 trillion by 2030, up from $10+ trillion in 2022. Over half of global institutional assets are now managed by UN Principles for Responsible Investment signatories. However, popularity of major tech stocks among ESG investors began cooling in 2023 after AI data center energy demands raised environmental concerns.

Effective shareholder activism would require coordinated efforts across multiple investor types: pension funds concerned about long-term stability, ESG-focused funds emphasizing governance, and individual investors willing to file shareholder resolutions. The key challenge lies in aligning investor incentives with safety outcomes rather than purely financial returns.

Whistleblowing and Transparency Mechanisms

Whistleblowing represents perhaps the highest-risk, highest-potential-impact form of corporate influence. Current legal protections for AI whistleblowers remain weak, with limited precedent for protection. However, 2024 saw unprecedented activity in AI whistleblowing.

Key Whistleblowing Events (2024)

Date	Event	Actors	Outcome
May 2024	Jan Leike resigns, posts "safety culture has taken a backseat to shiny products"↗	Jan Leike (Superalignment lead)	Joined Anthropic; significant media coverage
June 2024	Open letter from 13 AI workers↗ on safety risks and whistleblower protections	11 OpenAI + 2 DeepMind employees	Catalyzed legislative action
July 2024	Anonymous SEC whistleblower complaint↗ alleging illegal NDAs	Anonymous	SEC investigation; Congressional letters to OpenAI
Aug 2024	Daniel Kokotajlo reveals ≈50% AGI safety staff departed↗	Former OpenAI researcher	Confirmed safety team exodus
Oct 2024	Miles Brundage resigns; AGI Readiness team disbanded↗	Miles Brundage	Another major safety departure

Effective whistleblowing faces several structural challenges. The SEC whistleblower complaint↗ alleged four violations in OpenAI's employment agreements: non-disparagement clauses lacking SEC disclosure exemptions, requiring company consent for federal disclosures, confidentiality requirements covering agreements with embedded violations, and requiring employees to waive SEC whistleblower compensation. OpenAI spokesperson Hannah Wong stated the company would remove nondisparagement terms from future exit paperwork.

Legislative Response: AI Whistleblower Protection Act

In response to these events, Senate Judiciary Committee Chair Chuck Grassley introduced the AI Whistleblower Protection Act↗, a bipartisan bill co-sponsored by Senators Coons (D-Del.), Blackburn (R-Tenn.), Klobuchar (D-Minn.), Hawley (R-Mo.), and Schatz (D-Hawai'i). Key provisions include:

Prohibition on retaliation against employees reporting AI safety failures
Relief mechanisms including reinstatement, double back pay, and compensatory damages
Complaint process through Department of Labor with federal court appeals
Explicit protection for communications to Congress and federal agencies

The bill received support from 22 groups including the National Whistleblower Center. However, as of late 2024, it has not yet been enacted.

Current Deployment and Quantitative Assessment

The direct corporate influence approach has grown substantially since 2020, driven by increased recognition of AI risks and significant funding for safety work. Current estimates suggest 1,500-2,500 people globally work in safety-relevant positions at frontier AI labs, though this depends heavily on how "safety-relevant" is defined.

Talent Flow Dynamics

A notable asymmetry has emerged in talent flows between labs. According to industry analyses, engineers are 8x more likely to leave OpenAI for Anthropic than the reverse. Key researchers who departed OpenAI—including Jan Leike, Chris Olah, and other founding members—joined Anthropic, which has positioned itself as the "safety-first" alternative. This suggests safety culture may be a significant factor in talent decisions.

The geographical distribution of safety roles remains heavily concentrated, with approximately 60% in the San Francisco Bay Area, 25% in London (primarily DeepMind), and 15% distributed across other locations including New York, Boston, and remote positions. This concentration creates both advantages (critical mass of expertise) and risks (groupthink and similar perspectives).

Counterfactual Impact Assessment

Assessment of counterfactual impact remains highly uncertain. The key questions for career decisions include:

Question	Optimistic View	Pessimistic View	Current Evidence
Would someone less safety-conscious fill the role?	Yes—talent is scarce	No—labs would hire fewer	Limited data; likely varies by lab
Do safety teams influence critical decisions?	Yes—GPT-4 delays, RSPs	No—commercial pressure dominates	Mixed; OpenAI crisis suggests limits
Does working at labs provide legitimacy?	Minimal effect	Yes—signals responsible development	Plausibly significant
Is perspective capture a real risk?	Minimal—people maintain values	Yes—financial and social incentives	Anecdotal reports both ways
Are skills transferable to other safety work?	Yes—technical and network value	Partially—some lock-in	Generally positive

Career progression data shows relatively high retention in safety roles (80-85% after two years) compared to capabilities research (70-75%), suggesting either greater job satisfaction or fewer alternative opportunities. However, this may change as the independent safety research ecosystem grows and provides more exit opportunities for lab employees.

Safety Implications and Risk Assessment

The direct corporate influence approach presents both significant opportunities and concerning risks for AI safety. On the promising side, safety teams have demonstrably influenced critical deployment decisions. The staged release of GPT-4, extensive red-teaming programs, and development of responsible scaling policies all reflect safety input into lab operations. These interventions may have prevented premature deployment of dangerous capabilities or at minimum slowed development timelines.

Responsible scaling policies represent perhaps the most significant positive development. Anthropic's AI Safety Level framework creates explicit thresholds for enhanced safety measures as capabilities increase. If models reach concerning capability levels (like advanced biological weapons design), the policy triggers enhanced security measures, testing requirements, and potentially deployment pauses. Similar frameworks at DeepMind and other labs suggest growing acceptance of structured approaches to safety-performance tradeoffs.

However, the approach also carries substantial risks that critics argue may outweigh benefits. The legitimacy provided by safety teams may accelerate dangerous development by making it appear responsible and well-governed. Talented safety researchers joining labs signals to investors, regulators, and the public that risks are being managed, potentially reducing pressure for external governance or more fundamental changes to development practices.

Competitive dynamics pose perhaps the greatest challenge to internal safety influence. Even well-intentioned labs face pressure to match competitors' capabilities and deployment timelines. Safety concerns that might delay products or limit capabilities face strong internal resistance when competitors appear to be racing ahead. The OpenAI board crisis demonstrated how even governance structures explicitly designed to prioritize safety can be overwhelmed by commercial pressure.

Perspective capture represents a more subtle but potentially serious risk. Employees of AI labs naturally develop inside views that may systematically underestimate risks or overestimate the effectiveness of safety measures. The social environment, financial incentives, and professional relationships all create pressure to view lab activities favorably. Some former lab employees report that concerns that seemed urgent from the outside appeared less pressing from the inside, though they disagreed about whether this reflected better information or problematic bias.

Recent safety team departures highlight the limits of internal influence. Jan Leike's resignation statement that "safety culture has taken a backseat to shiny products" at OpenAI suggests that even senior safety leaders can feel their influence is insufficient. Similar concerns have been reported at other labs, though usually more privately.

Future Trajectory and Development Scenarios

Near-term Development (1-2 years)

The landscape for direct corporate influence will likely evolve significantly in the near term as AI capabilities advance and regulatory pressure increases. Safety team sizes are expected to grow 50-100% across major labs, driven by both increasing recognition of risks and potential regulatory requirements for safety staff. However, this growth may be outpaced by expansion in capabilities research, potentially reducing safety teams' relative influence.

Regulatory developments will significantly shape the effectiveness of corporate influence approaches. The EU AI Act's requirements for high-risk AI systems may force labs to invest more heavily in safety infrastructure, while potential US legislation could mandate safety testing and disclosure requirements. These external requirements could strengthen the hand of internal safety advocates by providing regulatory backing for safety measures that might otherwise be overruled by competitive pressure.

The privateness of most frontier labs represents a major limiting factor for shareholder activism, but this may change. Several labs are reportedly considering public offerings or major funding rounds that could create opportunities for investor pressure. The growing interest from ESG-focused funds and pension funds in AI governance could create significant pressure if appropriate mechanisms exist.

Whistleblowing may become more common and effective as legal protections develop and public interest in AI safety increases. Several jurisdictions are considering AI-specific whistleblower protections, while media coverage of AI safety has grown substantially, creating more opportunities for impactful disclosure of concerning practices.

Medium-term Evolution (2-5 years)

Over a 2-5 year horizon, the effectiveness of direct corporate influence will depend heavily on how competitive dynamics and regulatory frameworks evolve. If international coordination on AI development emerges, internal safety advocates could gain significantly more influence by having external backing for safety measures. Conversely, if competition intensifies further, internal pressure to prioritize capabilities over safety may increase.

The maturation of AI capabilities will test responsible scaling policies and other safety frameworks developed by corporate safety teams. If models begin demonstrating concerning capabilities like advanced biological weapons design or autonomous research capability, the effectiveness of current safety measures will become apparent. Success in managing these transitions could validate the corporate influence approach, while failures might discredit it.

Public market dynamics may become increasingly relevant as more AI companies go public or mature funding markets develop. This could enable more traditional forms of shareholder activism and corporate governance pressure. However, it might also increase short-term pressure for financial returns that conflicts with long-term safety considerations.

The independent AI safety ecosystem is likely to mature significantly, providing more attractive exit opportunities for lab employees and potentially changing recruitment dynamics. If organizations like Redwood Research, ARC, or new government AI safety institutions can offer competitive compensation and resources, they may attract talent away from frontier labs or provide credible outside options that strengthen negotiating positions.

Key Uncertainties and Research Priorities

Several critical uncertainties determine the ultimate effectiveness of direct corporate influence approaches. The question of net impact remains fundamentally unresolved: does working at frontier labs reduce existential risk by improving safety practices, or increase risk by accelerating development and providing legitimacy to dangerous racing dynamics?

Measurement challenges complicate assessment of impact. Unlike some other safety interventions, it's difficult to quantify the counterfactual effects of safety team work. When a concerning capability is identified during testing, how much does this reduce ultimate risk compared to discovering it after deployment? When a safety team influences deployment decisions, how much additional risk reduction does this provide beyond what would have occurred anyway due to reputational concerns or liability issues?

The durability of safety culture improvements remains highly uncertain. Current safety investments might represent genuine long-term commitments to responsible development, or they might be temporary responses to public and regulatory pressure that could erode when that pressure diminishes or competitive dynamics intensify. The speed of potential culture change in either direction is also unclear.

Regulatory development trajectories will significantly impact the relative value of different corporate influence approaches. Strong regulatory frameworks with meaningful enforcement could make internal safety advocacy much more effective by providing external backing. Weak or captured regulatory frameworks might make internal influence less valuable relative to other interventions.

Key research priorities include developing better methods for measuring safety team impact, analyzing the conditions under which internal safety advocates maintain influence over critical decisions, and understanding how competitive dynamics affect the sustainability of safety investments. Comparative analysis of safety culture across different labs and tracking changes over time could provide important insights for career decisions and strategic planning.

Direct corporate influence represents a high-stakes, morally complex approach to AI safety that may prove either essential or counterproductive depending on implementation details and external factors. Its ultimate effectiveness will likely depend on maintaining genuine safety influence within labs while avoiding the legitimization of dangerous racing dynamics—a balance that remains challenging to achieve.

Sources & Further Reading

Primary Sources

CNBC (May 2024): OpenAI dissolves Superalignment AI safety team↗ - Comprehensive coverage of the dissolution and Jan Leike's departure
Washington Post (July 2024): OpenAI illegally barred staff from airing safety risks, whistleblowers say↗ - SEC whistleblower complaint details
Bloomberg (November 2023): 90% of OpenAI Staff Threaten to Go to Microsoft If Board Doesn't Quit↗ - The employee revolt during the Altman crisis
Fortune (August 2024): OpenAI Exodus: Nearly half of AGI safety team gone↗ - Daniel Kokotajlo's revelations about safety staff departures

Legislative & Policy

Senate Judiciary Committee: Grassley Introduces AI Whistleblower Protection Act↗ - Full text and co-sponsors of the AI WPA
Institute for Law & AI: Protecting AI whistleblowers↗ - Analysis of legal protections needed

Industry Analysis

Levels.fyi: Anthropic Salaries↗ - Verified compensation data
SaferAI: Risk management assessment of frontier AI companies (35% highest score for Anthropic; "no company scored better than 'weak'")
Bloomberg Intelligence: Global ESG assets predicted to hit $10 trillion by 2030↗

Career Resources

80,000 Hours: Nick Joseph on Anthropic's safety approach↗ - Inside perspective on RSPs
IAPS: Mapping Technical Safety Research at AI Companies↗ - Comparative analysis of lab safety work
AI Lab Watch: Commitments tracker↗ - Monitoring lab safety commitments

References

1OpenAI disbands another safety team, as head advisor for 'AGI Readiness' resignsCNBC▸

In October 2024, OpenAI disbanded its 'AGI Readiness' team and lost its head policy and safety advisor Miles Brundage to resignation. This continued a pattern of safety team dissolutions and prominent researcher departures, fueling concerns that OpenAI is deprioritizing safety as it accelerates toward AGI development.

★★★☆☆

cnbc.com

2Open letter from 13 AI workerswhistleblowersblog.org▸

Thirteen current and former employees from OpenAI and other frontier AI labs published an open letter raising concerns about inadequate safety oversight and insufficient whistleblower protections within AI companies. The letter calls for stronger mechanisms enabling employees to report safety concerns without fear of retaliation, and advocates for greater transparency and accountability from AI developers. It highlights a gap between public safety commitments and internal company culture.

whistleblowersblog.org

3700+ of 770 employees sign letter threatening resignationBloomberg▸

Following the surprise firing of Sam Altman by OpenAI's board in November 2023, over 700 of OpenAI's approximately 770 employees signed an open letter threatening to resign and join Microsoft unless the board resigned and reinstated Altman. This mass employee action was a pivotal moment in the OpenAI governance crisis, ultimately contributing to Altman's reinstatement.

★★★★☆

bloomberg.com

4Microsoft and investors press for reinstatementBloomberg▸

Bloomberg reported on the intense pressure from Microsoft and OpenAI investors on the board to reinstate Sam Altman as CEO following his sudden firing in November 2023. The article covered the high-stakes standoff between the board and major stakeholders that ultimately led to Altman's return within days.

★★★★☆

bloomberg.com

5AI Whistleblower Protection Act (AI WPA)law-ai.org▸

This page from the Law-AI organization analyzes the AI Whistleblower Protection Act, a bipartisan U.S. Senate bill introduced in 2024 to protect AI industry employees who disclose safety-related information from employer retaliation. It contextualizes the legislation within the OpenAI exit-contract controversy and the 'right to warn' open letter, arguing that whistleblower protections are a low-cost, politically viable governance tool to help governments access critical safety information from those closest to frontier AI development.

law-ai.org

6OpenAI dissolves Superalignment AI safety teamCNBC▸

OpenAI disbanded its Superalignment team in May 2024, less than a year after launching it with a pledge of 20% compute resources toward controlling advanced AI. The dissolution followed the departures of team leaders Ilya Sutskever and Jan Leike, with Leike publicly criticizing OpenAI's safety culture as subordinated to product development.

★★★☆☆

cnbc.com

7Safe Superintelligence Incssi.inc▸

Safe Superintelligence Inc. (SSI) is a lab founded by Ilya Sutskever and others with the singular goal of building safe superintelligence. The company claims to approach safety and capabilities as joint technical problems, aiming to keep safety ahead of capabilities as they scale. Their model is explicitly designed to avoid short-term commercial pressures that might compromise safety priorities.

ssi.inc

8SEC whistleblower complaintkkc.com▸

A law firm press release covers an anonymous SEC whistleblower complaint (July 2024) alleging that OpenAI used illegal restrictive NDAs and non-disparagement agreements to prevent employees from reporting AI safety concerns to federal regulators. The complaint, initially obtained by the Washington Post from a Congressional source, calls for an SEC investigation into OpenAI's employment practices and demands reforms to ensure transparency and accountability.

kkc.com

9SaferAI - AI Safety Assessment Frameworksaferai.uk▸

SaferAI is an organization that provides safety assessments and evaluations of AI systems and developers, aiming to benchmark and publicly report on how well AI labs adhere to safety practices. It serves as an independent evaluation body helping stakeholders understand the safety posture of AI organizations.

saferai.uk

10Grassley Introduces AI Whistleblower Protection Actjudiciary.senate.gov·Government▸

Senator Chuck Grassley introduced bipartisan legislation to provide explicit federal protections for AI company employees who report safety concerns to government or Congress, directly addressing how restrictive NDAs and severance agreements silence potential whistleblowers. The bill merges existing AI oversight and whistleblower protection frameworks, offering remedies including reinstatement, back pay, and damages for retaliation.

judiciary.senate.gov

11AI Lab Watch: Commitments Trackerailabwatch.org▸

AI Lab Watch's Commitments Tracker monitors and evaluates the public safety commitments made by major AI laboratories, tracking whether frontier AI companies are honoring pledges related to safety, governance, and responsible deployment. It serves as an accountability tool by systematically documenting what labs have promised and assessing follow-through.

ailabwatch.org

12Daniel Kokotajlo reveals ~50% AGI safety staff departedFortune▸

Fortune reports on Daniel Kokotajlo's revelations that approximately 50% of OpenAI's AGI safety-focused researchers have departed the organization. The exodus raises serious concerns about OpenAI's commitment to safety as it accelerates toward AGI development, with departing researchers citing misalignment between stated safety priorities and actual organizational behavior.

★★★☆☆

fortune.com

13Bloomberg Intelligence projectsBloomberg▸

Bloomberg Intelligence forecasts that global ESG (Environmental, Social, and Governance) assets under management will reach $40 trillion by 2030, despite a challenging regulatory and political environment. The report highlights continued institutional investor interest in sustainable finance even amid ESG backlash in some markets. This projection underscores the growing financial weight of ESG considerations in capital allocation decisions.

★★★★☆

bloomberg.com

14AI Paygrades - Salary Transparency for AI Labsaipaygrad.es▸

AI Paygrades is a crowdsourced salary transparency platform aggregating compensation data for roles at major AI laboratories and tech companies. It provides salary benchmarks for researchers, engineers, and other staff at frontier AI labs, helping workers understand market rates and negotiate compensation. The site contributes to transparency around the economics of AI talent.

aipaygrad.es

15Nick Joseph on Anthropic's safety approach80,000 Hours▸

Nick Joseph, a researcher at Anthropic, discusses the company's approach to AI safety including their Responsible Scaling Policy, how they think about evaluating model capabilities and risks, and the internal culture around safety at Anthropic. The conversation covers practical mechanisms for slowing or pausing AI development if safety thresholds are breached.

★★★☆☆

80000hours.org

16Anthropic salary datalevels.fyi▸

Aggregated salary data for Anthropic employees across various roles and levels, sourced from crowdsourced submissions on levels.fyi. Provides compensation benchmarks for engineering, research, and other positions at the AI safety company. Useful for understanding the talent market and compensation norms at a leading AI safety-focused frontier lab.

levels.fyi

17Institute for AI Policy and Strategy analysisInstitute for AI Policy and Strategy▸

An IAPS analysis that maps and categorizes the technical AI safety research being conducted across major AI companies, identifying what areas are being prioritized, where gaps exist, and how industry research agendas compare. It provides a structured overview of the technical safety landscape within frontier AI labs.

★★★★☆

iaps.ai

18Senate Judiciary Committee Chair Chuck Grassley introduced the AI Whistleblower Protection Actgrassley.senate.gov·Government▸

Senator Chuck Grassley introduced the AI Whistleblower Protection Act to provide explicit legal protections for current and former AI employees who report safety concerns to federal authorities. The bill targets restrictive NDAs and severance agreements that silence AI workers, merging existing AI and whistleblower statutes to provide remedies including reinstatement, back pay, and damages for retaliation. The bipartisan legislation has broad congressional and advocacy group support.

grassley.senate.gov

19Jan Leike resigns, posts "safety culture has taken a backseat to shiny products"cbsnews.com▸

Jan Leike, co-lead of OpenAI's Superalignment team, publicly resigned in May 2024, stating that safety culture and processes had been deprioritized in favor of product development. His departure, alongside Ilya Sutskever, marked a significant exodus of safety-focused leadership from OpenAI. Leike's public statement raised concerns about whether OpenAI was living up to its stated safety commitments.

cbsnews.com

20filed an SEC complaintThe Washington Post▸

A former OpenAI employee filed a complaint with the SEC alleging that the company withheld information about AI safety risks from investors and regulators. The whistleblower claimed OpenAI's rapid capability development outpaced its safety measures and that internal concerns were suppressed. This represents a significant instance of formal regulatory escalation over AI safety culture at a leading frontier lab.

★★★★☆

washingtonpost.com

Corporate Influence on AI Policy