Skip to content
Longterm Wiki
Back

Credibility Rating

4/5
High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: OpenAI

This is OpenAI's official safety documentation for GPT-5, a primary source for understanding how a leading AI lab evaluates and communicates frontier model risks at deployment; directly relevant to discussions of AI safety standards, evaluation methodology, and responsible release practices.

Metadata

Importance: 78/100organizational reportprimary source

Summary

OpenAI's official system card for GPT-5 documenting the model's safety evaluations, capabilities assessments, and risk mitigations prior to deployment. It covers red-teaming results, alignment measures, and identified limitations across domains including cybersecurity, CBRN risks, and persuasion. The card represents OpenAI's formal safety disclosure process for a frontier model release.

Key Points

  • Documents pre-deployment safety evaluations including red-teaming across high-risk domains such as cybersecurity, CBRN, and persuasion/influence operations
  • Assesses GPT-5's capabilities and uplift potential in dangerous areas, providing risk ratings and mitigation measures
  • Details alignment and RLHF-based safety techniques used to reduce harmful outputs and improve instruction-following safety
  • Outlines OpenAI's Preparedness Framework assessments and how GPT-5 scored across tracked risk categories
  • Represents a key artifact in the evolving practice of frontier AI transparency and responsible deployment disclosure

Cited by 1 page

PageTypeQuality
Large Language ModelsCapability60.0

Cached Content Preview

HTTP 200Fetched Mar 20, 20264 KB
GPT-5 System Card \| OpenAI

August 7, 2025

[Publication](https://openai.com/research/index/publication/) [Safety](https://openai.com/news/safety-alignment/)

# GPT‑5 System Card

[Read the System Card(opens in a new window)](https://arxiv.org/abs/2601.03267) [Dive into the data(opens in a new window)](https://deploymentsafety.openai.com/gpt-5)

Share

GPT‑5 is a unified system with a smart and fast model that answers most questions, a deeper reasoning model for harder problems, and a real-time router that quickly decides which model to use based on conversation type, complexity, tool needs, and explicit intent (for example, if you say “think hard about this” in the prompt). The router is continuously trained on real signals, including when users switch models, preference rates for responses, and measured correctness, improving over time. Once usage limits are reached, a mini version of each model handles remaining queries. In the near future, we plan to integrate these capabilities into a single model.

In this system card, we label the fast, high-throughput models as gpt-5-main and gpt-5-main-mini, and the thinking models as gpt-5-thinking and gpt-5-thinking-mini. In the API, we provide direct access to the thinking model, its mini version, and an even smaller and faster nano version of the thinking model, made for developers (gpt-5-thinking-nano). In ChatGPT, we also provide access to gpt-5-thinking using a setting that makes use of parallel test time compute; we refer to this as gpt-5-thinking-pro.

It can be helpful to think of the GPT‑5 models as successors to previous models:

|     |     |
| --- | --- |
| **Previous model** | **GPT-5 model** |
| GPT-4o | gpt-5-main |
| GPT-4o-mini | gpt-5-main-mini |
| OpenAI o3 | gpt-5-thinking |
| OpenAI o4-mini | gpt-5-thinking-mini |
| GPT-4.1-nano | gpt-5-thinking-nano |
| OpenAI o3 Pro | gpt-5-thinking-pro |

This system card focuses primarily on gpt-5-thinking and gpt-5-main, while evaluations for other models are available in the appendix. The GPT‑5 system not only outperforms previous models on benchmarks and answers questions more quickly, but—more importantly—is more useful for real-world queries. We’ve made significant advances in reducing hallucinations, improving instruction following, and minimizing sycophancy, and have leveled up GPT‑5’s performance in three of ChatGPT’s most common uses: writing, coding, and health. All of the GPT‑5 models additionally feature safe-completions, our latest approach to safety training to prevent disallowed content.

Similarly to ChatGPT agent, we have decided to treat gpt-5-thinking as High capability in the Biological and Chemical domain under our [Preparedness Framework](https://openai.com/index/updating-our-preparedness-framework/), activating the associated safeguards. While we do not have definitive evidence that this model could meaningfully help a novice to create severe biological harm—our [defined threshold⁠(opens in a new window)](https://cdn.ope

... (truncated, 4 KB total)
Resource ID: 817c3cbf13144f20 | Stable ID: YjcxY2UwYj