Skip to content
Longterm Wiki

Mistral on MMLU: 60.1

benchmark-resultVerified

Child of MMLU

Metadata

Source Tablebenchmark_results
Source IDsl3OOLP0q8
ParentMMLU
Children
CreatedApr 24, 2026, 7:21 PM
UpdatedApr 24, 2026, 7:21 PM
SyncedApr 24, 2026, 7:21 PM

Record Data

idsl3OOLP0q8
benchmarkIdizV3Xk98se
modelIdMistral(ai-model)
score60.1
unitpercent
date2023-09-27
sourceUrl
notesMistral 7B Instruct on MMLU 5-shot scenario
testedByunknown
testedByOrgId
evaluationDate
methodologyNotes

Source Check Verdicts

confirmed98% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: sl3OOLP0q8

Source Table: benchmark_results

Source ID: sl3OOLP0q8

Parent Thing ID: izV3Xk98se