Skip to content
Longterm Wiki
All Source Checks
Publication

Publication: Measuring Massive Multitask Language Understanding (MMLU) by Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt (2020-09)

confirmed95% confidence

1 evidence check

Last checked: 4/3/2026

All key fields in the record are confirmed by the source text. The title, all seven authors in the correct order, and the publication date (2020-09, corresponding to arxiv submission September 2020) are explicitly stated. The arxiv URL format matches the standard pattern for arxiv papers. The publication type as 'paper' is appropriate for an arxiv preprint. No contradictions or discrepancies were found.

Evidence — 1 source, 1 check

confirmed95%Haiku 4.5 · 4/3/2026
Found: Title: 'Measuring Massive Multitask Language Understanding'; Authors: Dan Hendrycks, Collin Burns, Steven Basart, Andy Zou, Mantas Mazeika, Dawn Song, Jacob Steinhardt; Publication date: September 202

Note: All key fields in the record are confirmed by the source text. The title, all seven authors in the correct order, and the publication date (2020-09, corresponding to arxiv submission September 2020) are explicitly stated. The arxiv URL format matches the standard pattern for arxiv papers. The publication type as 'paper' is appropriate for an arxiv preprint. No contradictions or discrepancies were found.

Debug info

Record type: publication

Record ID: QNbZISNZtJ