Skip to content
Longterm Wiki
grant

Developing noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment

Metadata

Source Tablegrants
Source IDanT41CIghc
Source URLfunds.effectivealtruism.org/grants
Children
CreatedMar 12, 2026, 5:54 AM
UpdatedMar 14, 2026, 6:22 AM
SyncedMar 12, 2026, 4:12 PM

Record Data

idanT41CIghc
organizationIdLong-Term Future Fund (LTFF)(organization)
granteeIdAdelin Kassler
orgEntityIdLong-Term Future Fund (LTFF)(organization)
orgDisplayName
granteeEntityId
granteeDisplayNameAdelin Kassler
nameDeveloping noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment
amount40000
currencyUSD
period
date2024-07
status
sourcefunds.effectivealtruism.org/grants
sourceResourceId
notes[Long-Term Future Fund] Developing noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment
programIdxng_1vsce_
dataSourceId

Source Check Verdicts

confirmed95% confidence

Last checked: 4/3/2026

[deterministic-row-match] Deterministic match: grantee, amount matched in source snapshot (1628 rows)

Debug info

Thing ID: anT41CIghc

Source Table: grants

Source ID: anT41CIghc