All Source Checks
Automated source checking of wiki data against original sources. Each record is checked against one or more external sources to confirm accuracy.
View internal dashboard with coverage & action queue →Verified Correct
7,562
74% of checked
Has Issues
1,363
13% of checked
Can't Verify
1,314
13% of checkedincl. 265 dead links
Not Yet Checked
0
of 10,239 total
Contradicted
141
Fix now — data may be wrong
Outdated
28
Source has newer info
Accuracy Rate
98%
confirmed / (confirmed + wrong + outdated)
Needs Recheck
0
All up to date
Alignment
hjKdxW_2Br
Y-CBzdxiQh
HhuRJVCdVl
h7ikcuxgmh
rP03GkZm34
PUVNckO-Y4
DL7sT_jlOo
N-sXs9r1Wh
mGU0aT9Q2L
1WJtj22mIG
cQSrJ_VPPf
nsu5NXJayV
paLZxJdHlX
LS8icm7VoX
CAIS Compute Cluster
AI and Society Fellowship
oiu44mtkax
JbYcfgVmQH
br7vCVZ1ME
aBhKPMXJYl
vC9x3Q6kMh
PQrBTM2skw
Research
Field-Building
CAIS Action Fund
Research
Compute Cluster
Samotsvety - Footnote 25
BPiW9qyHpQ
sid_ePVee3jidQ / MMMU: 69.1
sid_ePVee3jidQ / LiveCodeBench: 65.4
sid_ePVee3jidQ / GSM8K: 96.4
sid_ePVee3jidQ / MMLU-Pro: 78.4
sid_ePVee3jidQ / HumanEval: 94
sid_ISfAiImMYg / SWE-bench Verified: 49
sid_ISfAiImMYg / GSM8K: 96.4
sid_v1e1ZwDwoA / HumanEval: 30.5
sid_v1e1ZwDwoA / GSM8K: 40.3
sid_v1e1ZwDwoA / HellaSwag: 84
sid_v1e1ZwDwoA / MMLU: 60.1
sid_nnv09Wl5OQ / LiveCodeBench: 79.4
sid_nnv09Wl5OQ / Chatbot Arena Elo: 1402
sid_nnv09Wl5OQ / MMLU-Pro: 79.9
sid_nnv09Wl5OQ / HumanEval: 86.5
sid_nnv09Wl5OQ / GSM8K: 89.3
sid_nywmt9QdsA / MathVista: 73.1
sid_nywmt9QdsA / MMLU: 80.1
sid_Gqv7h9oEwA / HellaSwag: 95
sid_Gqv7h9oEwA / GSM8K: 92
| Type | Entity | Claim | Verdict | Confidence | Sources | Last Checked | |
|---|---|---|---|---|---|---|---|
| Division | - | Alignment | confirmed | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | hjKdxW_2Br | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | Y-CBzdxiQh | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | HhuRJVCdVl | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | h7ikcuxgmh | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | rP03GkZm34 | confirmed | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | PUVNckO-Y4 | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | DL7sT_jlOo | unverifiable | 100% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | N-sXs9r1Wh | confirmed | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | mGU0aT9Q2L | confirmed | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | 1WJtj22mIG | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | cQSrJ_VPPf | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | nsu5NXJayV | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | paLZxJdHlX | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Funding Round | - | LS8icm7VoX | confirmed | 95% | 1 | Apr 29, 2026 | |
| Division | - | CAIS Compute Cluster | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Division | - | AI and Society Fellowship | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | oiu44mtkax | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | JbYcfgVmQH | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Policy Stakeholder | - | br7vCVZ1ME | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Funding Round | - | aBhKPMXJYl | confirmed | 95% | 1 | Apr 29, 2026 | |
| Funding Round | - | vC9x3Q6kMh | confirmed | 95% | 1 | Apr 29, 2026 | |
| Funding Round | - | PQrBTM2skw | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Division | - | Research | unverifiable | 95% | 1 | Apr 29, 2026 | |
| Division | - | Field-Building | partial | 95% | 2 | Apr 29, 2026 | |
| Division | - | CAIS Action Fund | confirmed | 95% | 1 | Apr 29, 2026 | |
| Division | - | Research | partial | 95% | 2 | Apr 29, 2026 | |
| Division | - | Compute Cluster | confirmed | 95% | 1 | Apr 29, 2026 | |
| Citation | - | Samotsvety - Footnote 25 | unverifiable | 73% | 4 | Apr 29, 2026 | |
| Funding Round | - | BPiW9qyHpQ | unverifiable | 95% | 2 | Apr 29, 2026 | |
| Benchmark Result | Claude 3.7 Sonnet | sid_ePVee3jidQ / MMMU: 69.1 | confirmed | 99% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.7 Sonnet | sid_ePVee3jidQ / LiveCodeBench: 65.4 | confirmed | 99% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.7 Sonnet | sid_ePVee3jidQ / GSM8K: 96.4 | confirmed | 99% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.7 Sonnet | sid_ePVee3jidQ / MMLU-Pro: 78.4 | confirmed | 99% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.7 Sonnet | sid_ePVee3jidQ / HumanEval: 94 | confirmed | 99% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.5 Sonnet | sid_ISfAiImMYg / SWE-bench Verified: 49 | confirmed | 98% | 1 | Apr 24, 2026 | |
| Benchmark Result | Claude 3.5 Sonnet | sid_ISfAiImMYg / GSM8K: 96.4 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Mistral | sid_v1e1ZwDwoA / HumanEval: 30.5 | confirmed | 98% | 1 | Apr 24, 2026 | |
| Benchmark Result | Mistral | sid_v1e1ZwDwoA / GSM8K: 40.3 | confirmed | 98% | 1 | Apr 24, 2026 | |
| Benchmark Result | Mistral | sid_v1e1ZwDwoA / HellaSwag: 84 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Mistral | sid_v1e1ZwDwoA / MMLU: 60.1 | confirmed | 98% | 1 | Apr 24, 2026 | |
| Benchmark Result | Grok | sid_nnv09Wl5OQ / LiveCodeBench: 79.4 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Grok | sid_nnv09Wl5OQ / Chatbot Arena Elo: 1402 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Grok | sid_nnv09Wl5OQ / MMLU-Pro: 79.9 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Grok | sid_nnv09Wl5OQ / HumanEval: 86.5 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | Grok | sid_nnv09Wl5OQ / GSM8K: 89.3 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | GPT-4.1 mini | sid_nywmt9QdsA / MathVista: 73.1 | unverifiable | 85% | 1 | Apr 24, 2026 | |
| Benchmark Result | GPT-4.1 mini | sid_nywmt9QdsA / MMLU: 80.1 | confirmed | 98% | 1 | Apr 24, 2026 | |
| Benchmark Result | GPT | sid_Gqv7h9oEwA / HellaSwag: 95 | confirmed | 95% | 1 | Apr 24, 2026 | |
| Benchmark Result | GPT | sid_Gqv7h9oEwA / GSM8K: 92 | confirmed | 95% | 1 | Apr 24, 2026 |
Data from source_check_verdicts table. Click a row to view detailed evidence.