Eliciting Latent Knowledge
Scalable OversightactiveExtracting what an AI model 'actually believes' rather than what it says, addressing the distinction between model knowledge and model outputs.
Organizations
4
Key Papers
3
Grants
3
Total Funding
$181K
First Proposed: 2022 (Christiano et al., ARC)
Cluster: Scalable Oversight
Parent Area: Scalable Oversight
Tags
elkknowledge-elicitationalignment
Organizations1
| Organization | Role |
|---|---|
| Alignment Research Center | pioneer |
Grants3
| Name | Recipient | Amount | Funder | Date |
|---|---|---|---|---|
| Grant to "support a competition for work on Eliciting Latent Knowledge, an open problem in AI alignment, for talented high school and college students who are participating in Prometheus Science Bowl." | Prometheus Science Bowl | $100K | FTX Future Fund | 2022-05 |
| A research & networking retreat for winners of the Eliciting Latent Knowledge contest | - | $72K | Long-Term Future Fund (LTFF) | 2022-10 |
| 3 months relocation from Chad to London to work on Eliciting Latent Knowledge with Jake Mendel from Apollo Research | Sienka Dounia | $8.5K | Long-Term Future Fund (LTFF) | 2024-01 |
Funding by Funder
| Funder | Grants | Total Amount |
|---|---|---|
| FTX Future Fund | 1 | $100K |
| Long-Term Future Fund (LTFF) | 2 | $81K |
Key Papers & Resources1
SEMINAL
Eliciting Latent Knowledge
Christiano et al. (ARC)2021