Back
Introducing Claude Opus 4.5
blogCredibility Rating
4/5
High(4)High quality. Established institution or organization with editorial oversight and accountability.
Rating inherited from publication venue: Anthropic
Official Anthropic announcement for Claude Opus 4.5 (November 2025); relevant for tracking frontier model capabilities, agentic AI development, and deployment practices that intersect with AI safety considerations.
Metadata
Importance: 52/100blog postnews
Summary
Anthropic announces Claude Opus 4.5, their most capable model optimized for coding, agentic tasks, and computer use, with significantly reduced pricing ($5/$25 per million tokens). The model demonstrates state-of-the-art performance on software engineering benchmarks, long-horizon autonomous tasks, and multi-step reasoning while being notably more token-efficient than predecessors.
Key Points
- •Claude Opus 4.5 achieves state-of-the-art results on real-world software engineering benchmarks and agentic workflows with improved token efficiency.
- •Pricing reduced to $5/$25 per million input/output tokens, making frontier-level capabilities more broadly accessible to developers and enterprises.
- •Demonstrates breakthrough capability in self-improving AI agents, autonomously refining capabilities in 4 iterations where competitors failed after 10.
- •Released alongside updates to Claude Developer Platform, Claude Code, and consumer apps including Excel and Chrome integrations.
- •Supports long-horizon autonomous tasks with sustained reasoning, showing 15% improvement over Sonnet 4.5 on Terminal Bench.
Cited by 4 pages
| Page | Type | Quality |
|---|---|---|
| Long-Horizon Autonomous Tasks | Capability | 65.0 |
| Reasoning and Planning | Capability | 65.0 |
| Tool Use and Computer Use | Capability | 67.0 |
| Anthropic | Organization | 74.0 |
Cached Content Preview
HTTP 200Fetched Feb 25, 2026272 KB
Announcements Introducing Claude Opus 4.5 Nov 24, 2025 Our newest model, Claude Opus 4.5, is available today. It’s intelligent, efficient, and the best model in the world for coding, agents, and computer use. It’s also meaningfully better at everyday tasks like deep research and working with slides and spreadsheets. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done. Claude Opus 4.5 is state-of-the-art on tests of real-world software engineering: Opus 4.5 is available today on our apps, our API, and on all three major cloud platforms. If you’re a developer, simply use claude-opus-4-5-20251101 via the Claude API . Pricing is now $5/$25 per million tokens—making Opus-level capabilities accessible to even more users, teams, and enterprises. Alongside Opus, we’re releasing updates to the Claude Developer Platform , Claude Code , and our consumer apps . There are new tools for longer-running agents and new ways to use Claude in Excel, Chrome, and on desktop. In the Claude apps, lengthy conversations no longer hit a wall. See our product-focused section below for details. First impressions As our Anthropic colleagues tested the model before release, we heard remarkably consistent feedback. Testers noted that Claude Opus 4.5 handles ambiguity and reasons about tradeoffs without hand-holding. They told us that, when pointed at a complex, multi-system bug, Opus 4.5 figures out the fix. They said that tasks that were near-impossible for Sonnet 4.5 just a few weeks ago are now within reach. Overall, our testers told us that Opus 4.5 just “gets it.” Many of our customers with early access have had similar experiences. Here are some examples of what they told us: Opus models have always been “the real SOTA” but have been cost prohibitive in the past. Claude Opus 4.5 is now at a price point where it can be your go-to model for most tasks. It’s the clear winner and exhibits the best frontier task planning and tool calling we’ve seen yet. Claude Opus 4.5 delivers high-quality code and excels at powering heavy-duty agentic workflows with GitHub Copilot. Early testing shows it surpasses internal coding benchmarks while cutting token usage in half , and is especially well-suited for tasks like code migration and code refactoring. Claude Opus 4.5 beats Sonnet 4.5 and competition on our internal benchmarks, using fewer tokens to solve the same problems . At scale, that efficiency compounds. Claude Opus 4.5 delivers frontier reasoning within Lovable's chat mode , where users plan and iterate on projects. Its reasoning depth transforms planning—and great planning makes code generation even better. Claude Opus 4.5 excels at long-horizon, autonomous tasks , especially those that require sustained reasoning and multi-step execution. In our evaluations it handled complex workflows with fewer dead-ends. On Terminal Bench it delivered a 15% improvement over Sonnet 4.5, a meaningful gain that becomes especially cle
... (truncated, 272 KB total)Resource ID:
57f01cae307e1cb1 | Stable ID: YzVhMzljNT