Skip to content
Longterm Wiki
Back

Credibility Rating

4/5
High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: OpenAI

Official OpenAI announcement for o3 and o4-mini models; relevant to AI safety discussions around rapidly advancing frontier model capabilities, inference-time compute scaling, and deployment of increasingly powerful agentic systems.

Metadata

Importance: 62/100press releaseprimary source

Summary

OpenAI's announcement of their o3 and o4-mini reasoning models, representing significant capability advances in chain-of-thought reasoning, coding, mathematics, and agentic tasks. These models build on the 'o-series' reasoning approach and demonstrate substantially improved performance on challenging benchmarks.

Key Points

  • o3 and o4-mini are advanced reasoning models using extended chain-of-thought processing before responding
  • Models show strong performance on math, science, coding, and visual reasoning benchmarks
  • o4-mini offers a cost-efficient alternative with competitive performance on reasoning-intensive tasks
  • Models support tool use including web search and code execution in agentic settings
  • Release continues OpenAI's scaling of inference-time compute as a path to capability improvements

Cited by 5 pages

Cached Content Preview

HTTP 200Fetched Mar 20, 202639 KB
OpenAI

April 16, 2025

[Release](https://openai.com/research/index/release/) [Product](https://openai.com/news/product-releases/)

# Introducing OpenAI o3 and o4-mini

[Try on ChatGPT(opens in a new window)](https://chatgpt.com/)

Loading…

Share

_**Update on June 10, 2025:**_ **_OpenAI o3‑pro is now available to Pro users in ChatGPT, as well as in our API_** _. Like OpenAI o1‑pro, o3‑pro is a version of our most intelligent model, OpenAI o3, designed to think longer and provide the most reliable responses. Full details can be found in our_ [_release notes_ ⁠(opens in a new window)](https://help.openai.com/en/articles/9624314-model-release-notes) _._

* * *

Today, we’re releasing OpenAI **o3** and **o4-mini,** the latest in our o-series of models trained to think for longer before responding. These are the smartest models we’ve released to date, representing a step change in ChatGPT's capabilities for everyone from curious users to advanced researchers. For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with Python, reasoning deeply about visual inputs, and even generating images. Critically, these models are trained to reason about when and how to use tools to produce detailed and thoughtful answers in the right output formats, typically in under a minute, to solve more complex problems. This allows them to tackle multi-faceted questions more effectively, a step toward a more agentic ChatGPT that can independently execute tasks on your behalf. The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness.

## What’s changed

**OpenAI o3** is our most powerful reasoning model that pushes the frontier across **coding, math, science, visual perception**, and more. It sets a new SOTA on benchmarks including Codeforces, SWE-bench (without building a custom model-specific scaffold), and MMMU. It’s ideal for complex queries requiring multi-faceted analysis and whose answers may not be immediately obvious. It performs especially strongly at visual tasks like analyzing images, charts, and graphics. In evaluations by external experts, o3 makes 20 percent fewer major errors than OpenAI o1 on difficult, real-world tasks—especially excelling in areas like programming, business/consulting, and creative ideation. Early testers highlighted its analytical rigor as a thought partner and emphasized its ability to generate and critically evaluate novel hypotheses—particularly within biology, math, and engineering contexts.

**OpenAI o4-mini** is a smaller model optimized for fast, cost-efficient reasoning—it achieves remarkable performance for its size and cost, particularly in **math, coding, and visual tasks**. It is the best-performing benchmarked model on AIME 2024 and 2025.

... (truncated, 39 KB total)
Resource ID: bf92f3d905c3de0d | Stable ID: NjY1OTZjYm