Skip to content
Longterm Wiki
Index
Benchmark Result·I4H3zQ7VJK·Record

sid_tppPAkJqjQ / HumanEval: 92

Verdictconfirmed99%
1 check · 4/29/2026

1 → confirmed

Our claim

entire record
Benchmark
vxX2rorgxU
Score
92
Unit
percent
Date
November 24, 2025
Notes
HumanEval - Python function implementation benchmark
Tested By
unknown

Source evidence

1 src · 1 check
confirmed99%inline-submission · 4/24/2026
Case № I4H3zQ7VJKFiled 4/29/2026Confidence 99%
Source Check: sid_tppPAkJqjQ / HumanEval: 92 | Longterm Wiki