Skip to content
Longterm Wiki

Introducing Sonnet 4.6 \ Anthropic

web

Credibility Rating

4/5
High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Anthropic

Anthropic's announcement of Claude Sonnet 4.6 is relevant to AI safety as it details safety evaluations, prompt injection resistance improvements, and computer use capabilities—areas with direct implications for agentic AI safety and deployment risks.

Metadata

Importance: 42/100press releasenews

Summary

Anthropic announces Claude Sonnet 4.6, a major upgrade featuring improved coding, computer use, long-context reasoning, and a 1M token context window. The release highlights safety evaluations showing strong safety behaviors and significant improvements in prompt injection resistance. The model achieves near-Opus-level performance at Sonnet pricing.

Key Points

  • Claude Sonnet 4.6 features a 1M token context window (beta) and major improvements in coding, computer use, and agentic planning.
  • Safety evaluations found 'very strong safety behaviors and no signs of major concerns around high-stakes forms of misalignment.'
  • Prompt injection resistance is significantly improved over Sonnet 4.5, now performing similarly to Opus 4.6 on this safety metric.
  • Computer use capabilities have advanced substantially on OSWorld benchmarks, with users reporting human-level performance on some tasks.
  • Pricing remains at $3/$15 per million tokens, making Opus-class performance more accessible.

Cached Content Preview

HTTP 200Fetched Apr 30, 202614 KB
Product Introducing Claude Sonnet 4.6

 Feb 17, 2026 Claude Sonnet 4.6 is our most capable Sonnet model yet . It’s a full upgrade of the model’s skills across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 also features a 1M token context window in beta.

 For those on our Free and Pro plans , Claude Sonnet 4.6 is now the default model in claude.ai and Claude Cowork . Pricing remains the same as Sonnet 4.5, starting at $3/$15 per million tokens.

 Sonnet 4.6 brings much-improved coding skills to more of our users. Improvements in consistency, instruction following, and more have made developers with early access prefer Sonnet 4.6 to its predecessor by a wide margin. They often even prefer it to our smartest model from November 2025, Claude Opus 4.5.

 Performance that would have previously required reaching for an Opus-class model—including on real-world, economically valuable office tasks —is now available with Sonnet 4.6. The model also shows a major improvement in computer use skills compared to prior Sonnet models.

 As with every new Claude model, we’ve run extensive safety evaluations of Sonnet 4.6, which overall showed it to be as safe as, or safer than, our other recent Claude models. Our safety researchers concluded that Sonnet 4.6 has “a broadly warm, honest, prosocial, and at times funny character, very strong safety behaviors, and no signs of major concerns around high-stakes forms of misalignment.”

 Computer use

 Almost every organization has software it can’t easily automate: specialized systems and tools built before modern interfaces like APIs existed. To have AI use such software, users would previously have had to build bespoke connectors. But a model that can use a computer the way a person does changes that equation.

 In October 2024, we were the first to introduce a general-purpose computer-using model. At the time, we wrote that it was “still experimental—at times cumbersome and error-prone,” but we expected rapid improvement. OSWorld , the standard benchmark for AI computer use, shows how far our models have come. It presents hundreds of tasks across real software (Chrome, LibreOffice, VS Code, and more) running on a simulated computer. There are no special APIs or purpose-built connectors; the model sees the computer and interacts with it in much the same way a person would: clicking a (virtual) mouse and typing on a (virtual) keyboard.

 Across sixteen months, our Sonnet models have made steady gains on OSWorld. The improvements can also be seen beyond benchmarks: early Sonnet 4.6 users are seeing human-level capability in tasks like navigating a complex spreadsheet or filling out a multi-step web form, before pulling it all together across multiple browser tabs.

 The model certainly still lags behind the most skilled humans at using computers. But the rate of progress is remarkable nonetheless. It means that computer use is much more useful for a range of work tas

... (truncated, 14 KB total)
Resource ID: c4a45eed67ee9087 | Stable ID: sid_BgNd6TAGCQ