Skip to content
Longterm Wiki

DeepSeek Models on DROP: 91.6

benchmark-resultVerified

Child of DROP

Metadata

Source Tablebenchmark_results
Source IDMiLn2eVl9N
ParentDROP
Children
CreatedApr 24, 2026, 6:54 PM
UpdatedApr 24, 2026, 6:54 PM
SyncedApr 24, 2026, 6:54 PM

Record Data

idMiLn2eVl9N
benchmarkIdcejlbJN241
modelIdDeepSeek Models(ai-model)
score91.6
unitpercent
date2024-12-01
sourceUrl
notesDeepSeek-V3 DROP F1 score, 3-shot evaluation
testedByunknown
testedByOrgId
evaluationDate
methodologyNotes

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: MiLn2eVl9N

Source Table: benchmark_results

Source ID: MiLn2eVl9N

Parent Thing ID: cejlbJN241