Longterm Wiki

Grades from external scorecards. We mirror published grades only; per-source methodology lives at the link in each panel header. See the scorecards directory for cross-org comparison.

Foundation Model Transparency Index

by Stanford CRFMPublished 2025-12-01Source ↗

Overall:66

Agent Protocols: 100
AI bug bounty: 100
Amount of usage: 0
AUP enforcement frequency: 0
AUP enforcement process: 100
Basic model properties: 100
Benchmarked inference: 0
Benefits Assessment: 100
Capabilities evaluation: 100
Capabilities taxonomy: 100
Carbon emissions for final training run: 0
Change log: 100
Classification of usage data: 0
Code access: 0
Compute hardware for final training run: 100
Compute provider: 100
Compute usage for final training run: 0
Compute usage including R&D: 0
Consumer/enterprise usage: 100
Crawling: 100
Data acquisition methods: 100
Data domain composition: 0
Data laborer practices: 100
Data language composition: 0
Data processing methods: 100
Data processing purpose: 100
Data processing techniques: 0
Data replicability: 0
Data retention and deletion policy: 100
Data size: 0
Deeper model properties: 100
Detection of machine-generated content: 100
Development duration for final training run: 0
Distribution channels with usage data: 0
Documentation for responsible use: 100
Downstream: 75
Acceptable use policy: 80
Accountability: 100
Downstream mitigations: 100
Impact: 71.4
Model Behavior Policy: 100
Post-deployment monitoring: 71.4
Usage data: 20
Energy usage for final training run: 0
Enterprise mitigations: 100
Enterprise users: 100
External data access: 0
External developer mitigations: 100
External products and services: 0
External reproducibility of capabilities evaluation: 100
External reproducibility of mitigations evaluation: 0
External reproducibility of risks evaluation: 0
External risk evaluation: 100
Feedback mechanisms: 0
Foundation model roadmap: 100
Geographic statistics: 0
Government commitments: 100
Government use: 100
Instructions for data generation: 100
Intermediate tokens: 100
Internal compute allocation: 0
Internal product and service mitigations: 100
Internal products and services: 100
Licensed data compensation: 100
Licensed data sources: 0
Misuse incident reporting protocol: 100
Mitigations efficacy: 0
Mitigations taxonomy: 100
Mitigations taxonomy mapped to risk taxonomy: 100
Model: 70
Capabilities: 75
Model cost: 0
Model dependencies: 100
Model access: 50
Model information: 75
Model Mitigations: 60
Model objectives: 100
Release: 87.5
Model response characteristics: 100
Risks: 60
Model stages: 100
Model theft prevention measures: 100
New human-generated data sources: 100
Notice of usage data used in training: 100
Open weights: 100
Organization chart: 100
Oversight mechanism: 100
Permitted and prohibited users: 100
Permitted, restricted, and prohibited model behaviors: 100
Permitted, restricted, and prohibited uses: 100
Post-deployment coordination with government: 0
Pre-deployment risk evaluation: 0
Public datasets: 100
Quantization: 100
Regional policy variations: 100
Release stages: 100
Researcher credits: 0
Responsible disclosure policy: 100
Risks evaluation: 100
Risks taxonomy: 100
Risk thresholds: 100
Safe harbor: 100
Security incident reporting protocol: 100
Specialized access: 0
Synthetic data purpose: 100
Synthetic data sources: 100
System prompt: 100
Terms of use: 100
Top distribution channels: 0
Train-test overlap: 0
Upstream: 52.9
Compute: 22.2
Data Acquisition: 91.7
Data Processing: 66.7
Data Properties: 0
Methods: 66.7
Other resources: 50
Usage data used in training: 100
Users of internal products and services: 0
Versioning protocol: 100
Water usage for final training run: 0
Whistleblower protection: 100

Grade trajectory (3 waves)

v1.2 December 2025latest: 66
v1.1 May 2024: 148.5
v1.0 October 2023: 25

AI21 Labs