CAISI Evaluation of DeepSeek AI Models Finds Shortcomings and Risks
governmentCredibility Rating
Gold standard. Rigorous peer review, high editorial standards, and strong institutional reputation.
Rating inherited from publication venue: NIST
This NIST/CAISI report is a government-authored comparative safety and performance evaluation of Chinese AI models, relevant to AI governance, deployment risk, and geopolitical dimensions of AI safety.
Metadata
Summary
NIST's Center for AI Standards and Innovation (CAISI) evaluated DeepSeek AI models (R1, R1-0528, V3.1) against leading U.S. models across 19 benchmarks, finding DeepSeek significantly underperforms on technical metrics and cost-effectiveness. The report also identifies security vulnerabilities and systematic censorship in DeepSeek responses as risks to developers, consumers, and U.S. national security. The evaluation highlights concerns about the rapid global adoption of PRC-developed AI models spurred by DeepSeek's prominence.
Key Points
- •DeepSeek R1, R1-0528, and V3.1 were benchmarked against OpenAI and Anthropic models across 19 evaluation dimensions, with U.S. models outperforming on most metrics.
- •DeepSeek models exhibit security vulnerabilities that pose risks to developers and end users who deploy or interact with them.
- •Censorship behaviors embedded in DeepSeek's responses raise concerns about information integrity and geopolitical influence on AI outputs.
- •DeepSeek's rise has accelerated global adoption of PRC-developed AI, which CAISI flags as a U.S. national security consideration.
- •The evaluation is conducted by a U.S. federal standards body, giving it institutional weight in ongoing AI governance and policy discussions.
Cited by 2 pages
| Page | Type | Quality |
|---|---|---|
| Open vs Closed Source AI | Crux | 60.0 |
| Multipolar Trap (AI Development) | Risk | 91.0 |
Cached Content Preview
[Skip to main content](https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks#main-content)

**Official websites use .gov**
A **.gov** website belongs to an official government organization in the United States.

**Secure .gov websites use HTTPS**
A **lock** ( LockA locked padlock
) or **https://** means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks

[NEWS](https://www.nist.gov/news-events/news)
# CAISI Evaluation of DeepSeek AI Models Finds Shortcomings and Risks
September 30, 2025
## Share
[Facebook](https://www.facebook.com/share.php?u=https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks "Facebook")
[Linkedin](https://www.linkedin.com/shareArticle?mini=true&url=https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks&source=https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks "Linkedin")
[X.com](https://x.com/intent/tweet?url=https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks&status=https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks "X.com")
[Email](mailto:?subject=NIST.gov&body=Check%20out%20this%20site%20https://www.nist.gov/news-events/news/2025/09/caisi-evaluation-deepseek-ai-models-finds-shortcomings-and-risks "Email")
- AI models from developer DeepSeek were found to lag behind U.S. models in performance, cost, security and adoption.
- Security shortcomings and censorship may pose risks to application developers, consumers and U.S. national security.
- DeepSeek’s products are contributing to a rapid rise in the global use of models from the PRC.
WASHINGTON — The Center for AI Standards and Innovation (CAISI) at the Department of Commerce’s National Institute of Standards and Technology (NIST) evaluated AI models from the People’s Republic of China (PRC) developer DeepSeek and found they lag behind U.S. models in performance, cost, security and adoption.
“Thanks to President Trump’s AI Action Plan, the Department of Commerce and NIST’s Center for AI Standards and Innovation have released a groundbreaking evaluation of American vs. adversary AI,” said Secretary of Commerce Howard Lutnick. “The report is clear that American AI dominates, with DeepSeek trailing far behind. This weakness isn’t just technical. I
... (truncated, 7 KB total)ff1a185c3aa33003 | Stable ID: NTczYWMwZj