Skip to content
Longterm Wiki
grant

Mitigating Reward Hacking Through RL Training Interventions

Metadata

Source Tablegrants
Source IDbs34HNgJhu
Source URLmanifund.org/projects/mitigating-reward-hacking-through-rl-training-interventions
Children
CreatedMar 12, 2026, 4:59 PM
UpdatedMar 14, 2026, 6:22 AM
SyncedMar 12, 2026, 4:59 PM

Record Data

idbs34HNgJhu
organizationIdManifund(organization)
granteeIdAria Wong
orgEntityIdManifund(organization)
orgDisplayName
granteeEntityId
granteeDisplayNameAria Wong
nameMitigating Reward Hacking Through RL Training Interventions
amount7900
currencyUSD
period
date2026-02-18
status
sourcemanifund.org/projects/mitigating-reward-hacking-through-rl-training-intervention…
notesTechnical AI safety
programId8jnn54YEbQ
dataSourceId

Source Check Verdicts

confirmed95% confidence

Last checked: 4/9/2026

[deterministic-row-match] Deterministic match: name, amount matched in source snapshot (20000 rows)

Debug info

Thing ID: bs34HNgJhu

Source Table: grants

Source ID: bs34HNgJhu