Skip to content
Longterm Wiki
Index
Benchmark Result·tVEnqaUPCi·Record

sid_kWPQCvjKSg / MATH: 73.8

Verdictconfirmed99%
1 check · 4/29/2026

1 → confirmed

Our claim

entire record
Benchmark
q6rR1sbyZG
Model
Llama
Score
73.8
Unit
percent
Date
July 23, 2024
Notes
Llama 3.1 405B Instruct, 0-shot chain-of-thought
Tested By
unknown

Source evidence

1 src · 1 check
Case № tVEnqaUPCiFiled 4/29/2026Confidence 99%
Source Check: sid_kWPQCvjKSg / MATH: 73.8 | Longterm Wiki