Skip to content
Longterm Wiki
Index
Grant·h_XG5A3Ok5·Record·Profile

Grant: UC Berkeley — Study on Frontier Model Behavior (Coefficient Giving → University of California, Berkeley)

Verdictconfirmed95%
1 check · 4/9/2026

Deterministic match: grantee, amount, date matched in source snapshot (2714 rows)

Our claim

entire record
Name
UC Berkeley — Study on Frontier Model Behavior
Amount
$499,597
Currency
USD
Date
June 2025
Notes
[Navigating Transformative AI] Open Philanthropy recommended a grant of $499,597 over three years to UC Berkeley to support a study on the behavior of frontier AI models. This work will be led by Professor Emma Pierson, expanding on her previous paper "Sparse Autoencoders for Hypexpand[Navigating Transformative AI] Open Philanthropy recommended a grant of $499,597 over three years to UC Berkeley to support a study on the behavior of frontier AI models. This work will be led by Professor Emma Pierson, expanding on her previous paper "Sparse Autoencoders for Hypothesis Generation”. In her original paper, Pierson used sparse autoencoders to analyze text data (like Yelp restaurant review scores) and identify text features that predict outcomes (like how many stars a user awards a restaurant). She then used an LLM to describe those text features as testable, natural-language hypotheses (like “words associated with quick service lead to higher review scores”).  This grant will enable Professor Pierson to investigate whether the HypotheSAEs technique can be used to identify properties in the text of LLM prompts that cause AIs to generate harmful responses. The grant will also allow for the exploration of technical upgrades to the HypotheSAEs pipeline, like using Matryoshka SAEs or transcoders. This falls within our focus area of potential risks from advanced artificial intelligence.

Source evidence

1 src · 1 check
confirmed95%deterministic-row-match · 4/9/2026
Name
UC Berkeley — Study on Frontier Model Behavior
Grantee
University of California, Berkeley
Focus Area
Navigating Transformative AI
Amount
$499,597.00
Date
June 2025

NoteDeterministic match: grantee, amount, date matched in source snapshot (2714 rows)

Case № h_XG5A3Ok5Filed 4/9/2026Confidence 95%