Child of Center for AI Safety (CAIS)
Metadata
| Source Table | publications |
| Source ID | QNbZISNZtJ |
| Description | Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt, 2020-09 |
| Source URL | arxiv.org/abs/2009.03300 |
| Parent | Center for AI Safety (CAIS) |
| Children | — |
| Created | Mar 23, 2026, 2:46 PM |
| Updated | Mar 23, 2026, 2:46 PM |
| Synced | Mar 23, 2026, 2:46 PM |
Record Data
id | QNbZISNZtJ |
entityId | Center for AI Safety (CAIS)(organization) |
entityDisplayName | — |
resourceId | — |
title | Measuring Massive Multitask Language Understanding (MMLU) |
authors | Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt |
url | arxiv.org/abs/2009.03300 |
venue | — |
publishedDate | 2020-09 |
publicationType | paper |
citationCount | — |
isFlagship | Yes |
abstract | — |
source | arxiv.org/abs/2009.03300 |
notes | Most widely used AI capability benchmark. ICLR 2021. |
Source Check Verdicts
confirmed95% confidence
Last checked: 4/29/2026
1 → confirmed
Debug info
Thing ID: QNbZISNZtJ
Source Table: publications
Source ID: QNbZISNZtJ
Parent Thing ID: sid_y4bieqSeag