Anthropic launches Claude Sonnet 5, a mid-tier AI model built for affordable autonomous agents

Reviewed byNidhi Govil

9 Sources

Share

Anthropic unveiled Claude Sonnet 5, its most agentic mid-tier model yet, designed to handle autonomous tasks like planning, coding, and browser control at a fraction of flagship costs. Starting at $2 per million input tokens, the model approaches Opus 4.8 performance while addressing enterprise concerns about ballooning AI bills. The release comes as Anthropic navigates regulatory scrutiny and races toward a potential IPO.

Anthropic releases Claude Sonnet 5 with near-flagship agentic capabilities

Anthropic launched Claude Sonnet 5 on June 30, 2026, positioning it as the company's most capable mid-tier AI model to date

1

. The new model delivers performance approaching the flagship Opus 4.8 across reasoning, coding, and planning and tool use tasks, but at less than half the cost

1

. Claude Sonnet 5 is now the default model for Claude's Free and Pro tiers, with availability extending to Max, Team, and Enterprise subscribers

2

. The model can make plans, drive browsers and terminals, and execute autonomous tasks that required larger, more expensive models just months ago

1

.

Source: 9to5Mac

Source: 9to5Mac

API pricing starts at an introductory rate of $2 per million input tokens and $10 per million output tokens through August 31, 2026, before rising to $3 and $15 respectively

3

. By comparison, Opus 4.8 costs $5 per million input tokens and $25 per million output tokens

5

. This aggressive pricing strategy directly addresses a pain point that has emerged as companies deploy AI agents across their operations: token consumption burns through budgets fast when agents loop, call tools, and run autonomously for extended periods

1

.

Performance benchmarks show Sonnet 5 closing the gap with Opus 4.8

On SWE-bench Pro, an agentic coding benchmark, Claude Sonnet 5 scored 63.2 percent compared with 69.2 percent for Opus 4.8 and 58.1 percent for its predecessor Sonnet 4.6

5

. On Terminal-Bench 2.1, another coding evaluation, Sonnet 5 reached 80.4 percent versus 67.0 percent for Sonnet 4.6 and 82.7 percent for Opus 4.8

5

. In multidisciplinary reasoning measured by Humanity's Last Exam, Sonnet 5 scored 57.4 percent with tools, essentially matching Opus 4.8's 57.9 percent

5

. On the knowledge-work benchmark GDPval-AA v2, it scored 1,618, surpassing Opus 4.8's 1,615

5

.

Source: Mashable

Source: Mashable

Early access partners reported that the model completes complex jobs where older Sonnets gave up . Sualeh Asif, co-founder of Cursor, noted that "with Claude Sonnet 5, agents stay on plan, follow our conventions, and ship clean multi-step changes, all at an efficient cost"

5

. Daniel Shepard, a senior engineer at Zapier, described handing the model a two-part automation job that "used to stall halfway" with previous models but now completes end to end

5

. Anthropic also introduced an "effort" dial, allowing developers to trade cost for accuracy between Sonnet 5 and Opus models

1

.

Reduced cybersecurity risks address regulatory concerns

Anthropic emphasized that Claude Sonnet 5 shows "substantially poorer performance" on cybersecurity-related tasks compared to Opus 4.8 and Mythos 5

2

. The company stated it did not deliberately train Sonnet 5 on cybersecurity tasks, and the model has a "much lower ability" to perform dangerous cyber activities than current Opus models

4

. In a test with Mozilla on the Firefox browser, the model never produced a working exploit

1

. Even so, Anthropic shipped it with real-time cyber safeguards enabled by default

1

.

Source: Gizmodo

Source: Gizmodo

This positioning matters because Anthropic remains in ongoing discussions with the Trump administration over model releases, discussions that include Sonnet 5

4

. The company's more powerful Mythos 5 and Fable 5 models remain under regulatory scrutiny after the government abruptly asked Anthropic to take them down over security concerns

4

. Mythos 5 is now available on a limited basis, and Fable 5 is on track to return soon

4

. The administration also asked OpenAI to stagger the release of its most powerful class of models, GPT-5.6

4

.

Cost-effective AI strategy targets enterprise AI adoption and IPO readiness

The release represents a clear strategic bet: make cost-effective AI powerful enough to handle production workloads while building the broad-based developer adoption that will prove attractive as Anthropic races toward a blockbuster IPO

5

. Companies have been pivoting to cheaper Chinese models amid renewed focus on AI usage costs

4

. Both Anthropic and OpenAI have reportedly been considering significant price cuts to attract new users and retain current ones

2

.

One technical consideration: Sonnet 5 uses an updated tokenizer that can map the same text to roughly 1.0 to 1.35 times as many tokens depending on content type . Anthropic calibrated the introductory API pricing to make the transition roughly cost-neutral, but enterprise AI customers running high-volume workloads should benchmark their specific use cases

5

. The company also increased rate limits across Chat, Cowork, Claude Code, and the Claude Platform to accommodate higher token usage at elevated effort levels

3

.

Anthropic reports that Sonnet 5 refuses malicious requests more often and resists prompt-injection attacks better than Sonnet 4.6, while also hallucinating and flattering less

1

. The model is available now in Claude's apps, Claude Code, and the API

1

. As AI labs continue releasing models while the administration determines which to allow and which to limit, the question for developers shifts from whether models are capable enough to whether they're affordable enough to run continuously

4

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved