Anthropic launches Claude Opus 4.8 as Mythos-class AI model nears public release

Reviewed byNidhi Govil

28 Sources

Share

Anthropic released Claude Opus 4.8 with significant improvements in honesty and coding capabilities, just over a month after version 4.7. The AI model is 4x less likely to make unsupported claims and shows notable gains in agentic coding. Meanwhile, the company plans to release its powerful Mythos-class AI model to all customers within weeks after restricting access through Project Glasswing.

Anthropic Releases Claude Opus 4.8 With Enhanced Honesty

Anthropic launched Claude Opus 4.8 on Thursday, delivering what the company describes as a "modest but tangible improvement" over its predecessor just over a month after releasing version 4.7

1

. The rapid iteration demonstrates how quickly the AI model landscape evolves, with the new large language model posting impressive benchmark gains particularly in agentic coding capabilities and what Anthropic calls "honesty in AI responses"

4

.

The most prominent improvement centers on reducing hallucination and unsupported claims. According to Anthropic, early testers found that Claude Opus 4.8 "is more likely to flag uncertainties about its work and less likely to make unsupported claims"

2

. The company's evaluations show the model is around 4x less likely than its predecessor to allow flaws in code it has written to pass unremarked

3

. This addresses a persistent problem where AI models sometimes jump to conclusions, confidently presenting their work as making progress despite thin evidence

5

.

Source: Decrypt

Source: Decrypt

Benchmark Improvements Show Significant Gains in Agentic Coding Capabilities

Beyond honesty improvements, Claude Opus 4.8 demonstrates substantial performance gains across multiple benchmarks. The model shows almost a 5 percentage point increase in agentic coding and over an 8 percentage point increase in agentic terminal coding compared to version 4.7

4

. Tom Pritchard, a staff engineer at Spotify who tested the model, noted that "Claude Opus 4.8 has noticeably better judgment. In Claude Code, it asks the right questions, catches its own mistakes, pushes back when a plan isn't sound, and builds up confidence around complex, multi-service explorations before making big changes"

3

.

The model also introduces new features that give users more control over computational resources. Users can now direct the amount of effort Claude puts into a task through Claude.ai and Claude Cowork. Higher-effort responses will use more tokens and potentially deliver better results, while lower settings respond faster and hit rate limits more slowly

1

. In coding tasks, the default high effort setting spends a similar number of tokens as Claude Code Opus 4.7's default level but with better performance

3

.

Source: NYT

Source: NYT

Dynamic Workflows Enable Hundreds of Parallel Subagents

Anthropic is launching a feature called dynamic workflows in research preview, designed for large-scale tasks such as codebase migrations across hundreds of thousands of lines of code. With dynamic workflows, "Claude can plan the work and then run hundreds of parallel subagents in a single session (and with Opus 4.8, the agents can run for even longer). It then verifies its outputs before reporting back to the user"

2

. Rather than following a fixed plan, agents can adjust their priorities and tasks based on what they discover during their work

3

. The debugging capabilities built into this feature connect directly to the model's improved honesty, as verifying results from hundreds of subagents requires reliable detection of uncertainty and failed outputs. Dynamic workflows will be available to Claude Code users on Enterprise, Team, and Max plans

3

.

Source: Gizmodo

Source: Gizmodo

Mythos-Class AI Model Coming to All Customers Within Weeks

While releasing Claude Opus 4.8, Anthropic announced it is making "swift progress" on developing safeguards for its Claude Mythos Preview model and expects to bring Mythos-class AI model capabilities to all customers "in the coming weeks"

1

. The company has restricted access to Mythos through Project Glasswing, limiting it to a consortium of partners including major tech firms such as Amazon, Microsoft, and Apple for cybersecurity purposes

5

.

The restricted release stems from the model's enhanced cybersecurity capabilities, which Anthropic deemed advanced enough to warrant giving cybersecurity experts and major tech companies lead time to patch flaws found by the model. Security researchers have confirmed the model can find exploits much more quickly than human hackers, with Mozilla's latest Firefox version including more than 200 fixes identified by Mythos Preview

1

. However, the model's cost may limit its appeal to threat actors. Jake Williams, a cybersecurity researcher at IANS Research, found Mythos was 30 times as expensive in tests as the previous Opus model, placing it "outside the reach of many, including any commodity threat actors"

1

.

Darren Williams, founder and CEO of cybersecurity firm BlackFog, noted that "the window between a powerful model's release and broad adoption of defenses against it is always a vulnerable moment," highlighting the tension inherent in releasing such capable models even with safeguards

1

. The upcoming public release will reveal whether Mythos lives up to expectations and how effective Anthropic's cybersecurity guardrails prove in practice.🟡 compliments to the story and in right order.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved