Skip to content
Longterm Wiki
Index
Benchmark Result·npuCCe8ZKX·Record

sid_kWPQCvjKSg / HumanEval: 89

Verdictconfirmed95%
1 check · 4/29/2026

1 → confirmed

Our claim

entire record
Benchmark
vxX2rorgxU
Model
Llama
Score
89
Unit
pass@1
Date
July 23, 2024
Notes
Llama 3.1 405B Instruct, 0-shot evaluation
Tested By
unknown

Source evidence

1 src · 1 check
Case № npuCCe8ZKXFiled 4/29/2026Confidence 95%
Source Check: sid_kWPQCvjKSg / HumanEval: 89 | Longterm Wiki