Skip to content
Longterm Wiki

Benchmarking and comparing different evaluation awareness metrics

Needs attention
$3K
Funder
Recipient
Hieu Minh Nguyen
Program
Date
Aug 2025
Data source
Source
Notes

[Technical AI safety, AI governance] LLMs often know when they are being evaluated. We’ll do a study comparing various methods to measure and monitor this capability.

Other Grants by Manifund

353
Showing 10 of 353 grants
Benchmarking and comparing different evaluation awareness metrics | Manifund | Grants | Longterm Wiki