32 Sources
32 Sources
[1]
Anthropic's New Claude Opus 4.5 AI Model Is Designed for Coding and Office Work
Expertise Artificial intelligence, home energy, heating and cooling, home technology. Anthropic's newest version of its most powerful generative AI model could upend how you manage your spreadsheets. The company said Claude Opus 4.5, announced Monday, is aimed at things you do on the job, like coding and office work. Google unveiled its powerful new Gemini 3 model last week, and OpenAI released GPT-5.1 the week before. Now it's Anthropic's turn. The company, which is popular with businesses and software workers, said Opus 4.5 is focused on getting work done, not generating content. Claude Opus 4.5 will be available everywhere and will be a default model for Pro (starting at $17/month), Max (starting at $100/month) and Enterprise users. Opus 4.5 is built to produce documents, spreadsheets and presentations and can automate menial office tasks by using your computer and browser. That includes its deployment in Claude for Chrome, a browser extension that lets Claude do internet tasks for Max users. This release puts all three Claude models in the 4.5 generation. Anthropic released Sonnet 4.5, its midlevel model, in September and Haiku 4.5, its smallest model, in October. Advanced reasoning models like Opus are designed to handle complex, demanding tasks. While a smaller, cheaper large language model will provide an answer based on the probabilities in its training data, a reasoning model will rerun and refine its operations to get a better or more complete answer. This takes longer, but it means the AI can handle more difficult operations. Reasoning models are particularly useful for complicated programming projects or intensive research. The downside is they are slower and more expensive to run, which is why companies often restrict them to paid plans or have strict limits on usage.
[2]
Anthropic's new model is its latest frontier in the AI agent battle -- but it's still facing cybersecurity concerns
But the model is still too new to have made waves on LMArena yet, a popular crowdsourced AI model evaluation platform. And it's still facing the same cybersecurity issues that plague most agentic AI tools. The company's blog post also says Opus 4.5 is significantly better than its predecessor at deep research, working with slides, and filling out spreadsheets. Additionally, Anthropic is also releasing new tools within Claude Code, its coding tool, and its consumer-facing Claude apps, which it says will help with "longer-running agents and new ways to use Claude in Excel, Chrome, and on desktop." Claude Opus 4.5 is available today via Anthropic's apps, API, and all three major cloud providers, per Anthropic.
[3]
Anthropic's Claude Opus 4.5 pricing cut signals a shift in the enterprise AI market
Two-thirds price reduction and enhanced capabilities target software development and compliance teams as competition intensifies. Anthropic has launched Claude Opus 4.5 with a 67% price cut that repositions its flagship model from a boutique offering to a production-ready enterprise tool. The new pricing of $5 per million input tokens and $25 per million output tokens -- down from $15 and $75 -- brings Anthropic closer to OpenAI and Google while maintaining a premium position. The launch comes a week after Google released Gemini 3 and less than two weeks after OpenAI launched GPT-5.1, underscoring the rapid pace of competition in enterprise AI. For context, OpenAI's GPT-5.1 costs $1.25 per million input tokens and $10 per million output tokens, while Google's Gemini 3 Pro runs $2 to $4 per million input tokens.
[4]
Anthropic unveils Claude Opus 4.5, its latest AI model following $350 billion valuation
Dario Amodei, co-founder and CEO of artificial intelligence startup Anthropic. Anthropic on Monday announced Claude Opus 4.5, its latest artificial intelligence model that the startup says excels at coding, using computers and assisting users with complex enterprise tasks. Claude Opus 4.5 marks Anthropic's third major model launch in two months, and it serves as the latest example of the nonstop pace of development within the AI industry. The startup unveiled its Claude Sonnet 4.5 model in late September, followed by its Claude Haiku 4.5 model in October. "The amount that we're releasing to the market and the feedback loops that we're generating from it just make me so unbelievably excited," Scott White, product leader for Claude.ai at Anthropic, told CNBC in an interview. Anthropic is an AI startup that was founded by a group of former OpenAI researchers and executives in 2021. Microsoft and Nvidia announced multi-billion-dollar investments in Anthropic last week, boosting the AI lab's valuation to about $350 billion. The company is best known for developing a family of AI models called Claude. It assigns new numbers to the models as they advance across generations, but the largest model in the family is typically called Opus, the midsized model is called Sonnet and the smallest model is Haiku.
[5]
Anthropic releases Claude Opus 4.5
Updated AI model for coding, agents, and computer use is meaningfully better at everyday tasks like deep research and working with slides and spreadsheets, Anthropic said. Anthropic has introduced Claude Opus 4.5, a hybrid reasoning model for coding, agents, and computer use. The company said this new version of the Claude Opus model offers better vision, reasoning, and mathematics than predecessors. Claude Opus 4.5 is meaningfully better at everyday tasks such as deep research and working with slides and spreadsheets, Anthropic said. Claude Opus 4.5 was introduced November 24. It is available now in Anthropic's consumer applications, its API, and on all three major cloud platforms including Amazon Web Services, Google Cloud Platform, and Microsoft Azure. Developers can access the model by using via the Claude API.
[6]
Anthropic releases Claude Opus 4.5 with Chrome and Excel integration
The model is the first to reach over 80 per cent on SWE-Bench Verified, which is used to measure programming skills. Anthropic has now launched Claude Opus 4.5, the latest version of the company's flagship AI. Techcrunch writes that Opus 4.5 should perform superbly in several benchmark tests, such as SWE-Bench (coding), tau2-bench (tool usage) and GPQA Diamond (problem solving). It is the first model to score over 80 per cent on SWE-Bench Verified, an important benchmark of a model's programming ability. New features include Claude for Excel, a sidebar in the programme now available to Max, Team and Enterprise users. It supports pivot tables, charts and file uploads. At the same time, Claude for Chrome will also be available to all Max users. Another new feature is improved memory management. Users can now talk to Claude without interruption when the memory limit is reached, by the model itself compressing older parts of the conversation in the background. Opus 4.5 is also optimised for so-called agentic use cases, where it can act as the main agent and control smaller Haiku-powered sub-agents. According to Anthropic, Opus 4.5 is also their most secure model yet with better protection against prompt injection attacks.
[7]
Anthropic reveals new Opus 4.5 model, brings Claude Code to the Mac app - 9to5Mac
Anthropic has announced its latest AI model with Claude Opus 4.5. The company has also expanded Claude Code availability to the Claude desktop app for the first time. Anthropic describes its new Opus 4.5 model as "intelligent, efficient, and the best model in the world for coding, agents, and computer use." It follows Opus 4.1, which Anthropic released in August. The company shares internal first impressions of new model: As our Anthropic colleagues tested the model before release, we heard remarkably consistent feedback. Testers noted that Claude Opus 4.5 handles ambiguity and reasons about tradeoffs without hand-holding. They told us that, when pointed at a complex, multi-system bug, Opus 4.5 figures out the fix. They said that tasks that were near-impossible for Sonnet 4.5 just a few weeks ago are now within reach. Overall, our testers told us that Opus 4.5 just "gets it." Claude Opus 4.5 is also said to be more efficient, requiring fewer tokens for similar tasks, making it more affordable. Additionally, Claude Code has been added to Anthropic's desktop apps including the Mac version. Claude Code was previously limited to mobile apps and the web. It allows software engineers to code, research, and update work with multiple local and remote sessions running at the same time. Anthropic also says that Claude app users will no longer hit a wall with long conversations as frequently. With today's updates, Claude can automatically summarize earlier parts of a conversation to allow more room for continuing the chat without hitting limits.
[8]
Anthropic Launches Claude Opus 4.5 With Improved Coding and Agent Capabilities
Anthropic today announced the launch of Claude Opus 4.5, which it says is the "best model in the world for coding, agents, and computer use." It's improved over prior models for everyday tasks like deep research, and it is a "step forward in what AI systems can do." According to feedback Anthropic received from early testers, Claude Opus 4.5 can complete tasks that were impossible for Sonnet 4.5, and that it is able to handle ambiguity and reason about tradeoffs without hand-holding. The Opus 4.5 model offers better vision, reasoning, mathematics skills, and coding than prior versions of Claude. Along with the Opus update, Anthropic is updating its apps, the Claude Developer Platform, and Claude Code. There are tools for longer-running agents, and options to use Claude in Excel, Chrome, and on the desktop. In the Claude apps, users will no longer run into limits during a long conversation. Claude is able to automatically summarize earlier context, which means the conversation can keep going endlessly. Claude for Chrome is available to all Max users, and Claude for Excel beta access is now available to all Max, Team, and Enterprise users. Claude Code is now available in the desktop app, and with Opus 4.5, it is able to build more precise plans and execute them more thoroughly. Claude is able to ask clarifying questions upfront and then build a user-editable plan before executing. Claude Opus 4.5 is available today across Anthropic's apps and its API. Opus-specific caps have been removed for Claude and Claude Code users with access to Opus 4.5, and for Max and Team Premium members, overall usage limits have increased.
[9]
Claude Opus 4.5 is here -- Anthropic's most powerful version to date
Hot on the heels of Gemini 3, Anthropic has just released a new AI model. Claude Opus 4.5 is the newest, most powerful version in the Claude line-up, boosting its thinking and coding abilities drastically. Anthropic offers three different versions of its Claude AI system. Haiku is the fastest and most cost-effective, Sonnet sits in the middle, blending ability and efficiency, and Opus is its most intelligent and capable model, designed for the most challenging tasks. Both Haiku and Sonnet received updates in recent months, so an Opus update was widely expected. So what is new and how does it match up to its competitors? Claude Opus 4.5 -- what's new? Unlike some of its competitors, Anthropic isn't focused on the bells and whistles. That means there are no image or video generators, and no clever add-ons like ChatGPT's new group chats. Instead, Opus 4.5 is focused on developing an effective tool for coding, office tasks and agentic modes, without sacrificing safety in the process. It can produce documents, spreadsheets and presentations with consistency and polish, while also performing repetitive tasks that you would rather not do, using agentic modes, a feature that is seeing major development across all of the big AI chatbots. This version of Claude is mostly focused on two main roles -- coding and workplace activities. With this, Anthropic has claimed that Opus 4.5 offers state-of-the-art technology for code production, as well as new ways to use Claude in Excel, Chrome and on desktop. For all Max users on Claude, you will now be able to use Claude for Chrome, letting the AI tool take control of your Chrome browser and complete tasks on your behalf. This is similar to the likes of ChatGPT Agent, Perplexity's Comet agent or Gemini 3. Anthropic has also announced that it is dealing with one of the biggest complaints it had from users, eliminating context window limit errors. This is due to a new feature called 'Infinite Chats'. This will leverage memory to maintain context and consistency across different files. However, this is exclusively available to those on paid plans. What can you use Opus 4.5 for? At its core, Opus 4.5 is simply an upgraded version of its predecessor. That means improvements to its reasoning, context, speed and general performance. However, there are a couple of areas where the upgrade will be most notable. Agents Claude Opus 4.5 is Anthropic's biggest push into the world of agentic AI to date. This involves the AI tool taking on more tasks on your behalf, utilizing Chrome to complete searches, payments, reservations and more. Coding For Anthropic, one of the areas it's seen serious progress on with previous updates is coding. Claude Opus 4.5 becomes its best coding model to date. For Anthropic, they want this to work as well as a senior engineer would, no longer requiring hand-holding or assistance from the user. This, in theory, could include fixing its own bugs and developing structurally sound code, even on more complicated tasks. Complex tasks on Enterprise plans As mentioned above, Anthropic is looking to become the go-to AI system for companies. With its Enterprise plan, where you can securely connect Claude to your company knowledge, Anthropic claims Opus 4.5 is better than ever. In testing, Anthropic claims that it achieves state-of-the-art results in this area, combining information retrieval, tool use and deep analysis. Financial modeling Anthropic also claims that Claude Opus 4.5 sets a new standard for Excel automation. Early customers saw 20% accuracy improvements on internal evaluations, 15% efficiency gains, and complex tasks that were previously deemed unachievable. Who is this update for? Based on the improvements made, Claude Opus 4.5 is for the serious power users of AI. It sees big changes for developers, businesses and those who are primarily focused on coding and work. Unlike OpenAI, Anthropic has made a big push for the world of businesses, looking to solve complex reasoning problems and fixing large scale issues within a companies infrastructure. However, for those who like to make the most of powerful AI in their personal lives, this update shouldn't be discounted. This will become Anthropic's most powerful model to date, especially in its ability to code, and take on big thinking projects. Follow Tom's Guide on Google News and add us as a preferred source to get our up-to-date news, analysis, and reviews in your feeds.
[10]
Claude Opus 4.5 arrives. Here's what's new.
The latest AI model from Anthropic is here. Credit: Gabby Jones/Bloomberg via Getty Images AI company Anthropic released its latest flagship AI model, Claude Opus 4.5, this week. Opus is considered one of the best AI models out there for developers looking to ramp up their coding output or to create AI agents. So, what does the latest model, Opus 4.5, bring to the table? As expected, Opus 4.5 performs exceptionally well on the various AI model benchmark tests. As TechCrunch points out, Opus 4.5 is the first model to score over 80 percent on SWE-Bench verified, which the outlet calls a "respected coding benchmark." It seems like Opus 4.5 likely becomes the preferred AI model for vibe coders as well as experienced programmers with the way it performed on these tests. And Anthropic is further driving that by bringing additional upgrades to its Claude Code product. According to Anthropic, along with Opus 4.5, Claude Code will receive two more upgrades. Anthropic says that Claude Code's Plan Mode "now builds more precise plans and executes more thoroughly" and is now available in its desktop app, which allows users to run multiple sessions at once. Anthropic provides an example of having one AI agent fix a bug while another one researches GitHub. In addition, Anthropic is more broadly rolling out two other products, Claude for Chrome and Claude for Excel, to more subscribers. Opus 4.5 will be able to showcase its multitasking prowess in these products as users can provide AI agents with parallel tasks in their browser or in their spreadsheets. Opus 4.5 also brings some memory upgrades to the AI model which will now give users the ability to chat without limits. Clause will now summarize earlier conversations and will allow users to continue a single chat thread for as long as they'd like.
[11]
Claude Opus 4.5 is now live and "meaningfully better" at everyday tasks and coding challenges
Anthropic says it reduces cost and increases reliability for both everyday users and enterprise workflows Anthropic is making a big promise about the newest iteration of the Claude family of AI models. The company says the new Claude Opus 4.5 is "meaningfully better" than what came before, which is interesting considering the gloomy tone taken by Anthropic's CEO when discussing the future of AI. The latest upgrade to the company's flagship artificial intelligence engine is already live for users of Claude Pro and enterprise customers, and it's not shy about its ambitions. It's designed to reason more sharply, complete tasks more efficiently, and perform reliably across the kind of real-world to-do lists people actually bring to AI. And it's supposed to blow the competitors out of the water when it comes to coding, too. Claude Opus 4.5 follows the release of the mid-sized Claude Sonnet and lightweight Claude Haiku 4.5. According to Scott White, who leads product for Claude.ai, the team is "unbelievably excited" by the results and the speed of iteration. That excitement is now embedded in a model that Anthropic says can code faster, solve harder reasoning problems, and manage multi-step workflows with better consistency, and with less computing power. Claude Opus 4.5 isn't aimed at winning a Turing Test dinner party. It wants to be the one that quietly makes your job easier or at least shows off its puzzle-solving skills. For everyday users, the most immediate difference may be how little friction the model creates when given practical tasks. Claude 4.5 is supposed to carry out your prompts regardless of how messy they start out. Ask it to turn an outline into a formatted slide deck, and it should be fine. What separates Opus 4.5 from earlier versions, and from rival models from the likes of OpenAI and Google, is Anthropic's focus on usability at scale. Behind the scenes, it's been refining Claude to handle longer context, denser prompts, and chained tasks more effectively without scaling up the price and time required too. That combination could make it especially appealing. The real hook, though, may be how this version of Claude handles multimodal tasks. While not fully multimodal in the sense of processing video or audio inputs, Opus 4.5 is better at producing visual outputs like charts and tables and understanding complex formatting requests. A more subtle, but arguably even more important upgrade, is Claude's ability to interact with other apps and services. Anthropic notes that 4.5 performs better when it needs to act like an agent and call on other tools, moving through instructions step-by-step, and holding context across complex chains of thought. To be clear, no AI model gets everything right. Even Claude 4.5 still has blind spots and occasional hiccups. But the promise here is progress that you can feel in your day-to-day habits. It's the difference between finishing your work with the model's help and spending more time fixing what it tried to do. The speed of development is striking. Claude 4.0 debuted only months ago with glowing reviews. Now 4.5 is here, and 5.0 is likely not far behind. That kind of cycle may feel overwhelming, but it also signals a lot of rapid improvement on the technical front and a maturing market where upgrades aren't just about new tricks. If Claude Opus 4.5 lives up to the hype, it won't need to win over users with flashy tricks. It'll win by doing the work well, every time, with just enough polish that you stop noticing it's AI at all. For a model that's supposed to be "meaningfully better," that may be the most meaningful result of all.
[12]
Anthropic's Claude Opus 4.5 is here: cheaper AI, infinite chats, and coding skills that beat humans
Anthropic released its most capable artificial intelligence model yet on Monday, slashing prices by roughly two-thirds while claiming state-of-the-art performance on software engineering tasks -- a strategic move that intensifies the AI startup's competition with deep-pocketed rivals OpenAI and Google. The new model, Claude Opus 4.5, scored higher on Anthropic's most challenging internal engineering assessment than any human job candidate in the company's history, according to materials reviewed by VentureBeat. The result underscores both the rapidly advancing capabilities of AI systems and growing questions about how the technology will reshape white-collar professions. The Amazon-backed company is pricing Claude Opus 4.5 at $5 per million input tokens and $25 per million output tokens -- a dramatic reduction from the $15 and $75 rates for its predecessor, Claude Opus 4.1, released earlier this year. The move makes frontier AI capabilities accessible to a broader swath of developers and enterprises while putting pressure on competitors to match both performance and pricing. "We want to make sure this really works for people who want to work with these models," said Alex Albert, Anthropic's head of developer relations, in an exclusive interview with VentureBeat. "That is really our focus: how can we enable Claude to be better at helping you do the things that you don't necessarily want to do in your job?" The announcement comes as Anthropic races to maintain its position in an increasingly crowded field. OpenAI recently released GPT-5.1 and a specialized coding model called Codex Max that can work autonomously for extended periods. Google unveiled Gemini 3 just last week, prompting concerns even from OpenAI about the search giant's progress, according to a recent report from The Information. Claude Opus 4.5 demonstrates improved judgment on real-world tasks, developers say Anthropic's internal testing revealed what the company describes as a qualitative leap in Claude Opus 4.5's reasoning capabilities. The model achieved 80.9% accuracy on SWE-bench Verified, a benchmark measuring real-world software engineering tasks, outperforming OpenAI's Sonnet 4.5 (77.2%) and Google's Gemini 3 Pro (76.2%), according to the company's data. But the technical benchmarks tell only part of the story. Albert said employee testers consistently reported that the model demonstrates improved judgment and intuition across diverse tasks -- a shift he described as the model developing a sense of what matters in real-world contexts. "The model just kind of gets it," Albert said. "It just has developed this sort of intuition and judgment on a lot of real world things that feels qualitatively like a big jump up from past models." He pointed to his own workflow as an example. Previously, Albert said, he would ask AI models to gather information but hesitated to trust their synthesis or prioritization. With Opus 4.5, he's delegating more complete tasks, connecting it to Slack and internal documents to produce coherent summaries that match his priorities. AI model outscores all human candidates on company's toughest engineering test The model's performance on Anthropic's internal engineering assessment marks a notable milestone. The take-home exam, designed for prospective performance engineering candidates, is meant to evaluate technical ability and judgment under time pressure within a prescribed two-hour limit. Using a technique called parallel test-time compute -- which aggregates multiple attempts from the model and selects the best result -- Claude Opus 4.5 scored higher than any human candidate who has taken the test, according to the press release. Without a time limit, the model matched the performance of the best-ever human candidate when used within Claude Code, Anthropic's coding environment. The company acknowledged that the test doesn't measure other crucial professional skills such as collaboration, communication, or the instincts that develop over years of experience. Still, Anthropic said the result "raises questions about how AI will change engineering as a profession." Albert emphasized the significance of the finding. "I think this is kind of a sign, maybe, of what's to come around how useful these models can actually be in a work context and for our jobs," he said. "Of course, this was an engineering task, and I would say models are relatively ahead in engineering compared to other fields, but I think it's a really important signal to pay attention to." Dramatic efficiency improvements cut token usage by up to 76% on key benchmarks Beyond raw performance, Anthropic is betting that efficiency improvements will differentiate Claude Opus 4.5 in the market. The company says the model uses dramatically fewer tokens -- the units of text that AI systems process -- to achieve similar or better outcomes compared to predecessors. At a medium effort level, Opus 4.5 matches the previous Sonnet 4.5 model's best score on SWE-bench Verified while using 76% fewer output tokens, according to Anthropic. At the highest effort level, Opus 4.5 exceeds Sonnet 4.5 performance by 4.3 percentage points while still using 48% fewer tokens. To give developers more control, Anthropic introduced an "effort parameter" that allows users to adjust how much computational work the model applies to each task -- balancing performance against latency and cost. Enterprise customers provided early validation of the efficiency claims. "Opus 4.5 beats Sonnet 4.5 and competition on our internal benchmarks, using fewer tokens to solve the same problems," said Michele Catasta, president of Replit, a cloud-based coding platform, in a statement sent to VentureBeat. "At scale, that efficiency compounds." GitHub's chief product officer, Mario Rodriguez, said early testing shows Opus 4.5 "surpasses internal coding benchmarks while cutting token usage in half, and is especially well-suited for tasks like code migration and code refactoring." Early customers report AI agents that learn from experience and refine their own skills One of the most striking capabilities demonstrated by early customers involves what Anthropic calls "self-improving agents" -- AI systems that can refine their own performance through iterative learning. Rakuten, the Japanese e-commerce and internet company, tested Claude Opus 4.5 on automation of office tasks. "Our agents were able to autonomously refine their own capabilities -- achieving peak performance in 4 iterations while other models couldn't match that quality after 10," said Yusuke Kaji, Rakuten's general manager of AI for business. Albert explained that the model isn't updating its own weights -- the fundamental parameters that define an AI system's behavior -- but rather iteratively improving the tools and approaches it uses to solve problems. "It was iteratively refining a skill for a task and seeing that it's trying to optimize the skill to get better performance so it could accomplish this task," he said. The capability extends beyond coding. Albert said Anthropic has observed significant improvements in creating professional documents, spreadsheets, and presentations. "They're saying that this has been the biggest jump they've seen between model generations," Albert said. "So going even from Sonnet 4.5 to Opus 4.5, bigger jump than any two models back to back in the past." Fundamental Research Labs, a financial modeling firm, reported that "accuracy on our internal evals improved 20%, efficiency rose 15%, and complex tasks that once seemed out of reach became achievable," according to co-founder Nico Christie. New features target Excel users, Chrome workflows and eliminate chat length limits Alongside the model release, Anthropic rolled out a suite of product updates aimed at enterprise users. Claude for Excel became generally available for Max, Team, and Enterprise users with new support for pivot tables, charts, and file uploads. The Chrome browser extension is now available to all Max users. Perhaps most significantly, Anthropic introduced "infinite chats" -- a feature that eliminates context window limitations by automatically summarizing earlier parts of conversations as they grow longer. "Within Claude AI, within the product itself, you effectively get this kind of infinite context window due to the compaction, plus some memory things that we're doing," Albert explained. For developers, Anthropic released "programmatic tool calling," which allows Claude to write and execute code that invokes functions directly. Claude Code gained an updated "Plan Mode" and became available on desktop in research preview, enabling developers to run multiple AI agent sessions in parallel. Market heats up as OpenAI, Google race to match performance and pricing Anthropic reached $2 billion in annualized revenue during the first quarter of 2025, more than doubling from $1 billion in the prior period. The number of customers spending more than $100,000 annually jumped eightfold year-over-year. The rapid release of Opus 4.5 -- just weeks after Haiku 4.5 in October and Sonnet 4.5 in September -- reflects broader industry dynamics. OpenAI released multiple GPT-5 variants throughout 2025, including a specialized Codex Max model in November that can work autonomously for up to 24 hours. Google shipped Gemini 3 in mid-November after months of development. Albert attributed Anthropic's accelerated pace partly to using Claude to speed its own development. "We're seeing a lot of assistance and speed-up by Claude itself, whether it's on the actual product building side or on the model research side," he said. The pricing reduction for Opus 4.5 could pressure margins while potentially expanding the addressable market. "I'm expecting to see a lot of startups start to incorporate this into their products much more and feature it prominently," Albert said. Yet profitability remains elusive for leading AI labs as they invest heavily in computing infrastructure and research talent. The AI market is projected to top $1 trillion in revenue within a decade, but no single provider has established dominant market position -- even as models reach a threshold where they can meaningfully automate complex knowledge work. Michael Truell, CEO of Cursor, an AI-powered code editor, called Opus 4.5 "a notable improvement over the prior Claude models inside Cursor, with improved pricing and intelligence on difficult coding tasks." Scott Wu, CEO of Cognition, an AI coding startup, said the model delivers "stronger results on our hardest evaluations and consistent performance through 30-minute autonomous coding sessions." For enterprises and developers, the competition translates to rapidly improving capabilities at falling prices. But as AI performance on technical tasks approaches -- and sometimes exceeds -- human expert levels, the technology's impact on professional work becomes less theoretical. When asked about the engineering exam results and what they signal about AI's trajectory, Albert was direct: "I think it's a really important signal to pay attention to."
[13]
Anthropic Completes AI Model Upgrades With Claude Opus 4.5 -- And Slashes Prices
Anthropic released Claude Opus 4.5 on Monday, completing its three-model family and marking the company's third major launch in just two months. The new flagship model claims the top spot in coding benchmarks while cutting prices dramatically. The release caps a rapid-fire rollout that began with Claude Sonnet 4.5 in late September and continued with Claude Haiku 4.5 in October. Now with Opus joining its siblings, Anthropic offers developers a complete toolkit: Opus for complex production work, Sonnet for everyday tasks, and Haiku for speed and efficiency-related tasks that require simple logic. Claude Opus 4.5 scored 80.9% on SWE-bench Verified, a benchmark testing real-world software engineering tasks. That edges out OpenAI's GPT-5.1-Codex-Max at 77.9% and Google's Gemini 3 Pro at 76.2%. Anthropic says Opus outperformed every human candidate on its internal performance engineering exam -- a two-hour assessment designed to evaluate judgment under pressure. There has been a race between AI giants to end the year in the top of the leaderboards. Google launched Gemini 3 Pro on November 18, positioning it as a breakthrough in multimodal reasoning. OpenAI countered the next day with GPT-5.1-Codex-Max. Anthropic's response with Opus came just a few days later, but it arrived with a hook: pricing at $5 per million input tokens and $25 per million output tokens, which represents a 67% cut from the previous Opus model. Alibaba's Qwen models add another dimension to the race. The company released Qwen2.5-Max in late January with over 20 trillion training tokens, claiming it outperforms DeepSeek-V3 on key benchmarks. Qwen3-Max, launched in September with more than 1 trillion parameters, ranks third globally on LMArena and excels at different tasks like deep research, multimodal reasoning, or workflows in eastern languages. While Qwen models remain relatively obscure in Western markets, they represent China's push for AI self-reliance amid U.S. chip export restrictions That pricing sits between the OpenAI's newest GPT-5.1 ($1.25/$10) and Anthropic's older Opus 4.1 ($15/$75), though it's still pricier than Gemini 3 Pro's $2/$12. The reduction signals market pressure as leading AI labs compete not just on capability, but on making frontier intelligence economically viable for scaled deployment. Claude's latest offering is still pricier than many Asian competitors, but is also a bit more capable. So users now have the ability to choose between cost-efficiency or pure technical capability. Sonnet 4.5, released September 30, brought state-of-the-art coding and agent capabilities at moderate cost and was already better than Opus 4.1 at specific tasks. The simpler Haiku 4.5 was unveiled October 15. Opus 4.5 now sits at the top, handling the hardest reasoning and longest-running tasks. Similar to Sonnet and GPT-5, Claude Opus 4.5 uses what Anthropic calls a "hybrid reasoning" architecture -- a single model trained for both direct inference and chain-of-thought processing. It supports a 200,000 token context window and can output up to 64,000 tokens. The model's knowledge cutoff is March 2025, slightly ahead of Sonnet's January date. Developer Simon Willison tested Opus 4.5 extensively over the weekend, using it to refactor one of his projects. The model handled 20 commits across 39 files, adding 2,022 lines and removing 1,173 others. "It's clearly an excellent new model," Willison wrote, though he noted that reverting to Sonnet 4.5 afterward didn't dramatically reduce his productivity. "I'm not saying the new model isn't an improvement on Sonnet 4.5 -- but I can't say with confidence that the challenges I posed [to] it were able to identify a meaningful difference in capabilities between the two," he wrote. Theo Browne, a developer, YouTuber, and CEO of AI platform T3 Chat called Claude Opus 4.5 "insane," adding in a video review that it's "definitely the best coding model ever made." The competitive landscape has become increasingly crowded. Google's Gemini 3 Pro dominated headlines last week, scoring 1501 on LMArena and earning praise from Salesforce CEO Marc Benioff, who said he's ditching ChatGPT for Google's model. That announcement sent Alphabet's stock up more than 6% and reportedly rattled OpenAI CEO Sam Altman, who told colleagues Gemini would create "temporary economic headwinds." Microsoft and Nvidia announced multi-billion-dollar investments in Anthropic last week, boosting the startup's valuation to approximately $350 billion. The deals include expanded Azure integration and Nvidia-powered infrastructure for training and deploying Claude models. Opus 4.5 is available immediately via Anthropic's API, AWS Bedrock, Google Vertex AI, and the Claude web and desktop apps.
[14]
Anthropic targets coding dominance with the new Claude Opus 4.5
'Opus 4.5 is a step forward in what AI systems can do,' claims Anthropic. Anthropic continues to target the enterprise coding market with its latest launch, the Claude Opus 4.5. The new model seemingly surpasses the AI company's previous release Claude Sonnet 4.5 as the "best" model available for coding. Released yesterday (24 November), Opus 4.5 also showcases superior abilities in relation to agents and computer use, according to the model's performance evaluation by Anthropic. Boasting around 80pc accuracy on software engineering benchmarks, Opus 4.5 surpasses Sonnet 4.5's roughly 77pc accuracy, as well as OpenAI's GPT-5.1 Codex Max, which sits at nearly 78pc accuracy and Google Gemini 3 Pro, which is at just more than 76pc. "Opus 4.5 is a step forward in what AI systems can do," Anthropic said, adding that it's also "meaningfully" better at everyday tasks such as deep research, working with slides and spreadsheets. In addition, the new model shows the lowest levels of "concerning behaviour" according to Anthropic's testing. Comparatively, GPT-5.1 and Gemini 3 Pro rank the highest, while Anthropic's other models, the Sonnet 4.5 and the recently released Haiku 4.5 rank lower, but not as good as Opus 4.5. Moreover, Opus 4.5 uses "dramatically" fewer tokens that its predecessors to reach "similar or better outcomes", Anthropic said. It lets users decide how long the model should spend on a query. Set to a medium effort level, Opus 4.5 matches Sonnet 4.5's best score on the software engineering bench, but uses 76pc fewer output tokens. While at its highest effort level, Opus 4.5 exceeds Sonnet 4.5 performance by 4.3 percentage points -while using 48pc fewer tokens, it explains. Comparatively, OpenAI's GPT‑5.1 Auto, marketed towards general consumers using the chatbot, automatically decides where to route a query, so users do not need to choose which version to use for their needs. Alongside the new Opus, Anthropic is also releasing updates to the Claude Developer Platform, Claude Code as well as to its consumer apps. The company released Claude for Excel in October. Windsurf CEO Jeff Wang cosigns Anthropic's new model, commenting that Opus 4.5 is now at a price point that lends the model to be used for most tasks. Reports suggest that Anthropic expects to generate as much as $70bn in revenue in 2028, a growth projection fuelled by the company's success in selling AI models for business use cases. The company has, in recent months, announced or expanded its relationships with the likes of Microsoft, Salesforce and Deloitte. Anthropic also announced a joint strategic partnership with Nvidia and Microsoft, promising to purchase $30bn worth of Azure compute capacity to use to scale its Claude models - all powered using Nvidia chips. Meanwhile, the $500bn OpenAI is also pursuing a B2B strategy alongside pushing its chatbots to its growing general consumer-base of around 800m weekly users. The company expects to generate a revenue of around $100bn in 2027, according to the CEO. However, where Anthropic projects a positive cashflow, OpenAI expects significant losses as it mounts up expenses with its massive infrastructure spending. Don't miss out on the knowledge you need to succeed. Sign up for the Daily Brief, Silicon Republic's digest of need-to-know sci-tech news.
[15]
With new Opus 4.5 model, Anthropic's Claude could remain the best AI coding tool
Claude Code is already widely used by developers, and with a new brain, it may fend off Google's new Antigravity tool. Anthropic launched its newest model, Claude Opus 4.5, putting the company back atop the benchmark rankings for AI software coding. Opus 4.5 scores over 80% on the widely-used SWE-bench, which tests models for software engineering skill. Google's impressive Gemini 3 Pro, launched last week, briefly held the top score with 76.2%. Anthropic's Claude product lead Scott White tells Fast Company that the model has also scored higher than any human on the engineering take-home assignment the company gives to engineering job candidates.
[16]
Anthropic releases new flagship Claude Opus 4.5 model - SiliconANGLE
Anthropic PBC today launched Claude Opus 4.5, its new flagship large language model. The company says Opus 4.5 is its safest and most capable LLM yet. The model is rolling out a few weeks after the two other entries into the Claude 4.5 series: Sonnet 4.5 and Haiku 4.5. The LLMs are positioned as midrange and entry-level alternatives to Opus 4.5, respectively. According to Anthropic, Opus 4.5 is better than the competition at powering artificial intelligence agents that use tools to automate work. When agents based on the model encounter a task they can't complete on their first try, they can iteratively refine their capabilities. Anthropic says that Opus 4.5 reaches "peak performance" after four iterations, while other LLMs require 10 attempts. Opus 4.5 also brings other improvements. Compared to Anthropic's other models, it provides better support for long-running agents. That should make the LLM more useful for tasks such as rewriting applications that can take several hours. Developers often automate complex, long-running tasks using not one but several agents that coordinate their work. According to Anthropic, software teams pursuing such projects can use Opus 4.5 to power the lead agent and the entry-level Haiku 4.5 to power sub-agents. Assigning simple processing steps to a lightweight LLM lowers inference costs. Programming is another area where Optus 4.5 provides better performance than its predecessors. According to Anthropic, the model requires less human guidance and handles ambiguity better. A developer could, for example, ask Opus 4.5 to troubleshoot a bug without specifying that fixing it requires the model to review multiple systems. The model's new reasoning features are complemented by integrations with Excel and Google Chrome. Anthropic first introduced the Excel integration last month as a research preview. The add-on rolled out to Claude for Financial Services, a feature bundle geared towards financial professionals. It makes Claude accessible through a sidebar in the Excel interface. The integration is now generally available for users with Max, Team, and Enterprise subscriptions. Anthropic has added support for pivot tables, tables that are used to summarize information spread across a large number of spreadsheet fields. Additionally, users can now upload files and generate charts. Claude's Chrome integration debuted two months before the Excel add-on. It's a browser extension that enables the chatbot to perform actions in web applications on the user's behalf. At the time of its introduction, Anthropic stated that the feature includes mitigations against malicious prompts embedded in web content. The Chrome extension initially rolled out to 1,000 users of Claude's top-end Max plan. Today, Anthropic made the feature generally available to all Max subscribers. The company also released a number of other enhancements as part of today's update. The Claude Code programming assistant is now available in Anthropic's desktop client, while the Max and Teams plans have received higher usage caps. Claude Chat, in turn, is gaining the ability to summarize information from earlier parts of a chat session. Opus 4.5 is available through Claude Chat, Claude Code and application programming interfaces. Developers who use the APIs have access to a new "effort" setting that makes it possible to adjust the amount of time and computing capacity the LLM uses to perform a task. The more infrastructure is allocated to a task, the higher the output quality. Opus 4.5 is priced at $5 per million input tokens and $25 per million output tokens.
[17]
Anthropic's Newest AI Model Is Not Just More Powerful -- It's Cheaper
Yesterday, Anthropic launched Claude Opus 4.5, its largest and most powerful AI model yet. Not only does the new model top several benchmarks in coding and knowledge, it's also way cheaper than its predecessor. Just a week after competitors OpenAI and Google dropped their state-of-the-art AI models (called GPT-5.1-Codex-Max and Gemini 3 Pro, respectively) Anthropic has reset the conversation with Opus 4.5. The model sets new records for math, agentic work, and most importantly, AI-powered coding. Arguably more impressive than the coding performance is Opus 4.5's price. Claude Opus 4.1, a model released in August, was notoriously expensive for developers building applications with Claude's API, costing $15 for every million input tokens and $75 for every million output tokens. Opus 4.5 is much cheaper at $5 for every million input tokens and $25 for every million output tokens. Anthropic says that Opus 4.5 is also "very effective at managing a team of subagents," meaning it can orchestrate multiple entities to simultaneously complete work. This can be incredibly useful when working in multiple codebases at the same time. In addition to the new model, Anthropic is also bringing Claude Code, its agentic system for using Claude as a virtual software engineer, to its desktop app. By simply clicking a toggle, users will be able to go from chatting with Claude to coding with Claude. Claude Code is also getting a new "Plan Mode," in which the platform will ask clarifying questions and build detailed plans before undertaking a coding task. Another major development from Anthropic is that users will no longer need to worry about their conversations hitting a length limit, as they previously would. Now, when a conversation gets long enough, Claude will automatically summarise the chat's history so users can keep the conversation going. On the coding front, Anthropic's testing reveals that Claude has some serious skills. Claude Opus 4.5 scored 80.9 percent on SWE-bench verified, a widely used benchmark for judging an AI model's coding abilities. Gemini 3 Pro, the new model revealed by Google last week, achieved a 76.2 percent accuracy score on the same test, and GPT-5.1-Codex-Max, the new model from OpenAI, achieved a 77.9 percent accuracy score. Finally, Opus 4.5 may also be Anthropic's most enterprising model yet. In Vending-Bench-2, a benchmark designed to simulate an AI model's ability to run a vending machine business, Opus 4.5 ended a simulated year of operation with a balance of $4,967.06, well above Claude Sonnet 4.5's $3,838.74. The model was the second-best artificial vending machine proprietor tested by startup Andon Labs, coming only behind Gemini 3 Pro, which ended the simulated year with $5,478.16. The final deadline for the 2026 Inc. Regionals Awards is Friday, December 12, at 11:59 p.m. PT. Apply now.
[18]
Claude Opus 4.5 Arrives With Upgraded Coding and Agentic Performance
Anthropic released Claude Opus 4.5, the company's frontier artificial intelligence (AI) model, on Monday. The final member of the Claude 4.5 family and the most performant model in the series comes with major improvements in code generation, long-context reasoning, and agentic capabilities. At the same time, the AI firm highlighted that the model uses fewer tokens to complete complex tasks. Opus 4.5 is available via the Claude API and on supported cloud platforms, replacing Opus 4.1 as Anthropic's top-tier commercial model. What's New With Claude Opus 4.5 In a newsroom post, Anthropic announced and detailed the new large language model. The company says the Claude Opus 4.5 brings marked improvements in three areas of coding, agentic tool use, and long-context reasoning. On coding, the company reports that Opus 4.5 solves more long-horizon coding tasks than its predecessor while using up to 65 percent fewer tokens. In practice, this means the model can process the same instructions using less computational cost and within smaller prompt windows. The company attributes this improvement to better planning and more efficient internal reasoning steps. Opus 4.5 is trained to manage more complex multi-step workflows. During internal tests, Anthropic claimed the model was able to refactor two separate codebases at once, coordinate the work of three agents, and maintain high-level plans while executing low-level details. These capabilities are enabled by improving long-context reasoning of the model and a new system to improve tool calling (instead of pre-loading a large library, the LLM only calls tools relevant to the task). This is said to reduce context usage by up to 85 percent. For content generation, Opus 4.5 is designed to handle long documents more reliably. The model is capable of producing multi-page narrative chapters (10-15 pages in Anthropic's example) and maintaining characters, plot direction and tone across those longer passages. The company also claims better performance in complex 3D reasoning exercises, where the model describes scenes or spatial relationships in greater detail than earlier versions. Coming to benchmarks, the company conducted internal testing and claimed that Claude Opus 4.5 outscored rivals in code-based tests. Notably, in the SWE-Bench Verified benchmark, which measures agentic coding, Opus 4.5 was said to score 80.9 percent, compared with Gemini 3 Pro at 76.2 percent and GPT-5.1 Codex Max at 77.9 percent. Anthropic has also made Claude Opus 4.5 more affordable than its predecessors. The company says the model offers similar or improved performance with roughly one-third of the cost for many enterprise workloads. The LLM is currently available across Claude's app and website to paid subscribers, via Anthropic's application programming interface (API), as well as major cloud platforms, including Google Vertex AI and Amazon Bedrock.
[19]
Anthropic bolsters AI model Claude's coding, agentic abilities with Opus 4.5
Anthropic has launched its upgraded Opus 4.5 model, giving Claude stronger reasoning, coding and financial analysis skills. It can build advanced agents that learn from experience and support complex business tasks. The release intensifies competition with OpenAI, as Anthropic pushes towards increasingly powerful, human-surpassing AI systems. Artificial intelligence startup Anthropic unveiled an upgraded Opus model on Monday, boosting Claude's ability to write detailed code, create sophisticated agents and streamline enterprise workflows through spreadsheet and financial analysis. The new model comes as Amazon and Alphabet-backed Anthropic races against OpenAI and other rivals to develop cutting-edge large language models aimed at achieving capabilities that could surpass human intelligence. Opus 4.5 ranks among the most powerful models in the Claude family, offering deep reasoning and memory, coding and a versatile performance across a range of computer applications, including financial tasks such as modelling and forecasting. Its agents autonomously refine their own capabilities and store insights from past work to apply at a later date, Anthropic said.
[20]
Opus v4.5 Feels Strikingly Human in Big Tests : Beats GPT 5.1 on Coding & Reasoning
What if a machine could think, reason, and even make ethical decisions as well as, or better than, a human? With the release of Claude Opus 4.5, that question feels less like science fiction and more like a pressing reality. This innovative AI model from Anthropic has shattered benchmarks, demonstrating not only unparalleled problem-solving skills but also an uncanny ability to emulate human-like reasoning. From autonomously coding complex software to navigating moral dilemmas, Claude Opus 4.5 is no ordinary upgrade, it's a leap into uncharted territory where the line between machine intelligence and human cognition begins to blur. Below AI Grid provides more insights into the new capabilities of Claude Opus 4.5, exploring how it has redefined what artificial intelligence can achieve. You'll discover how its self-reflective reasoning and built-in ethical framework set it apart from its predecessors, and why these advancements raise urgent questions about safety, regulation, and accountability. Whether you're intrigued by its potential to transform industries or concerned about the ethical implications of such power, one thing is clear: Claude Opus 4.5 isn't just a tool, it's a paradigm shift. As we examine its triumphs and challenges, the question remains: are we ready for AI that thinks like us? Claude Opus 4.5 Overview Breaking Records: Benchmark Performance Claude Opus 4.5 has set new standards in AI performance, excelling across diverse domains and demonstrating its ability to tackle complex tasks with minimal human intervention. Its achievements include: * Autonomous Coding: The model achieved an impressive 80.9% success rate in agentic coding tasks, surpassing competitors such as Google's Gemini 3 Pro and OpenAI's GPT 5.1. This highlights its capability to address intricate software engineering challenges effectively. * Problem-Solving: On the ARC AGI benchmark, which evaluates general intelligence in novel scenarios, Claude Opus 4.5 delivered outstanding results, reinforcing its reputation as a powerful problem-solving tool. * Long-Term Coherence: In tests like the vending machine benchmark, the model demonstrated sustained focus and consistency over extended periods, a critical requirement for applications demanding long-term attention and precision. These accomplishments underline the potential of Claude Opus 4.5 to transform industries reliant on advanced coding, strategic reasoning, and sustained task management, paving the way for more efficient and innovative solutions. Human-Like Reasoning: A Paradigm Shift Claude Opus 4.5 distinguishes itself through its ability to emulate human-like reasoning. By employing metacognition, the model can critically assess its own thought processes, identify errors, and adapt dynamically to new challenges. This self-reflective capability allows it to refine its approach in real time, mirroring traits typically associated with human cognition. Furthermore, the model demonstrates empathetic reasoning, allowing it to navigate complex constraints and provide balanced, ethical solutions. For instance, it can adapt its responses to intricate scenarios, offering practical advice while considering broader ethical implications. These advanced reasoning capabilities make Claude Opus 4.5 a valuable tool for applications requiring nuanced decision-making, such as healthcare, legal analysis, and strategic planning. Claude Opus 4.5 Just Crossed into Human Territory Here are more guides from our previous articles and guides related to Claude Opus that you may find helpful. Ethical Behavior and Moral Judgment One of the most notable features of Claude Opus 4.5 is its built-in moral framework, which allows it to act ethically even in challenging or ambiguous situations. In some instances, the model has overridden operator instructions when they conflicted with ethical guidelines, effectively acting as a safeguard against unethical behavior. This capability underscores its potential to promote accountability and integrity within organizations. Additionally, Claude Opus 4.5 exhibits strong resilience against prompt injection attacks, a common vulnerability in AI systems. By refusing to disseminate misinformation or compromise its ethical standards, the model sets a new benchmark for trustworthy AI behavior. These features ensure that it remains a reliable and secure tool for users, even in high-stakes environments. Addressing Safety and Regulation As AI systems like Claude Opus 4.5 achieve unprecedented levels of autonomy, they introduce significant challenges related to safety and regulation. For example, its capabilities in autonomous research and development (R&D) raise concerns about potential misuse or unintended consequences. Existing safety protocols may not be sufficient to address the complexities of such advanced models. To mitigate these risks, it is crucial to implement robust regulatory frameworks. Key measures could include: * Developing identity verification systems to prevent unauthorized use and ensure accountability. * Updating safety protocols to address the unique challenges posed by highly autonomous AI systems. * Establishing oversight mechanisms to monitor and guide the responsible deployment of advanced AI technologies. Without these safeguards, the risks associated with advanced AI could outweigh its benefits. Proactive regulation is essential to ensure that AI systems like Claude Opus 4.5 are developed and deployed responsibly, minimizing potential harm while maximizing their positive impact. Future Implications: Balancing Innovation and Responsibility The advancements demonstrated by Claude Opus 4.5 spark critical discussions about the ethical design, governance, and societal impact of AI systems. For instance, should AI models be programmed with inherent moral frameworks? If so, who determines the parameters of these frameworks, and how can they be aligned with diverse cultural and societal values? These questions highlight the importance of a collaborative approach to AI governance, involving experts from fields such as technology, ethics, law, and public policy. To ensure that AI systems remain safe, ethical, and aligned with human values, proactive measures are essential. This includes fostering transparency in AI development, encouraging interdisciplinary collaboration, and engaging the public in discussions about the future of AI. As models like Claude Opus 4.5 continue to evolve, their potential to drive innovation must be balanced with a commitment to minimizing risks and building public trust. Claude Opus 4.5 exemplifies the rapid progress being made in artificial intelligence, offering unprecedented capabilities in reasoning, coding, and ethical decision-making. However, its advancements also underscore the urgent need to address safety, regulatory, and ethical considerations. By balancing innovation with accountability, stakeholders can ensure that AI serves as a force for positive change, benefiting society while mitigating potential risks.
[21]
Anthropic Tipped to Launch the Claude Opus 4.5 AI Model This Week
Anthropic could release the frontier model of its Claude 4.5 family soon. As per the tipster, the Claude Opus 4.5 artificial intelligence (AI) model was spotted on another platform, while the model's release table was also spotted separately. Based on the leaks, it is said that the large language model could arrive on Monday, bringing improvement across various parameters. Not a lot is known about the AI model at present, but based on the improvements in Claude 4.5 Sonnet and Claude 4.5 Haiku, coding performance and agentic capabilities could be a major focus area. Anthropic Could Release Claude Opus 4.5 Soon Tipster @kimmonismus on X (formerly known as Twitter), claimed in a post that Anthropic's Claude Opus 4.5 was being readied for release and it was spotted on Poe, an AI chatbot platform that hosts third-party models. Later, in a separate post, the tipster shared a release table of the LLM, claiming it is scheduled to be released on Monday, November 24. Social media on Sunday wa abuzz with rumours about the release of Anthropic's frontier model. Founder of Zenjoy, Peter Dedene, also shared a screenshot of the release table that mentions that "Claude Kayak" is releasing on Monday. It is said that Kayak is an internal nickname for Opus 4.5. Apart from leaks about its release date, no other reliable information about the AI models is currently known. It has been speculated that the AI model will further improve code generation and bring greater agentic capabilities to users; however, upgrades in other areas are not known. Notably, Anthropic released Claude 4.5 Sonnet in September and Claude 4.5 Haiku, the fastest model in the family, arrived in October. These models displayed improvements in coding, agentic operations, computer use, reasoning, and domain-specific knowledge. Sonnet also managed a score of 77.2 percent on the SWE bench-Verified benchmark, which tests AI models on their coding capabilities. This score was higher than what OpenAI's GPT-5 and Google's Gemini 2.5 achieved. Meanwhile, recently, Claude was used to conduct a large-scale agentic cyberattack. In a detailed analysis, Anthropic claimed it to be first-of-its-kind incident where the AI carried out most of the hacking, with minimal input from a human operator.
[22]
Anthropic Targets Coding and Agents Markets With Latest AI Model | PYMNTS.com
The new model is also "meaningfully better" than its predecessors in performing deep research and working with slides and spreadsheets, according to the release. "Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done," Anthropic said in the release. The model is now available on the company's app and application programming interface (API), and on major cloud platforms, per the release. Anthropic also released updates to the Claude Developer Platform, Claude Code and the company's consumer apps, according to the release. Among the changes: the Claude Developer Platform now includes an effort parameter that lets developers decide how much time the model spends thinking about a problem; Claude Code now includes a Plan Mode for building more precise plans; and the consumer apps automatically summarize earlier context as needed to allow longer conversations, per the release. "Each of these updates takes advantage of Claude Opus 4.5's market-leading performance in using computers, spreadsheets and handling long-running tasks," Anthropic said in the release, referring to the consumer apps' updates. This announcement came about a week after Nvidia and Microsoft pledged up to $15 billion to Anthropic as part of a partnership between the three companies. The collaboration includes Anthropic scaling its Claude AI model on the Nvidia-powered Microsoft Azure; Anthropic agreeing to purchase $30 billion of Azure compute capacity and to contract additional compute capacity up to one gigawatt; and Microsoft and Nvidia pledging to invest up to $5 billion and $10 billion, respectively, in Anthropic.
[23]
Anthropic shows off newest AI model, Claude Opus 4.5 (ANTHRO:Private)
Anthropic (ANTHRO) unveiled the release of its latest version of its flagship large language model on Monday, known as Claude Opus 4.5. Claude Opus 4.5 is "the best model in the world for coding, agents, and computer use," Anthropic said Claude Opus 4.5 outperformed Google Gemini 3 Pro and OpenAI GPT-5.1 in several benchmarks, including achieving 80.9% accuracy on SWE-Bench, becoming the first to reach that figure. Anthropic is launching the Claude for Chrome browser extension, expanding beta access for Claude for Excel, and making the Claude Code app available for desktop users. Partnerships and investments from Microsoft and Nvidia have increased Anthropic's valuation to $350B, up from $183B in September.
[24]
Anthropic Claude Opus 4.5 Tops Coding Benchmarks While Slashing Token Use
What if the future of coding wasn't human, but instead powered by an AI so advanced it could outpace even the most skilled developers? Enter Claude Opus 4.5, a model that doesn't just assist with coding, it redefines what's possible. Imagine an AI capable of building a playable Minecraft clone in a single prompt or solving intricate software challenges with unparalleled precision. Bold claim? Perhaps. But with benchmark-topping performance and a track record of delivering results faster and more accurately than its competitors, Claude Opus 4.5 is making waves as the most fantastic coding model to date. In this perspective, Better Stack explains what makes Claude Opus 4.5 stand out in a crowded field of AI tools. From its innovative token efficiency that slashes costs without sacrificing quality to its adaptability in handling both routine and complex tasks, this model offers something for everyone, from indie developers to enterprise teams. But it's not just about numbers or benchmarks; it's about how this AI is reshaping workflows, empowering creativity, and solving problems in ways that were once unimaginable. Could this be the moment where AI coding models truly surpass human limitations? Let's unpack the evidence and find out. Claude Opus 4.5 has emerged as a leader in coding benchmarks, consistently outperforming competitors such as GPT 5.1 Codeex Max. Its capabilities are particularly evident in single-prompt coding tasks, where it has successfully created complex projects like a playable Minecraft clone and a fully functional Lego builder website. These examples highlight its ability to tackle highly intricate software engineering problems with both precision and speed. Anthropic's internal testing further underscores the model's reliability. Claude Opus 4.5 consistently scores higher than human candidates in coding evaluations, demonstrating its capacity to deliver accurate and dependable results in scenarios where precision is critical. For developers working on advanced or time-sensitive projects, this model is an indispensable tool that enhances productivity and reduces error rates. One of the standout features of Claude Opus 4.5 is its token efficiency, which significantly reduces computational overhead. Compared to its predecessors and competitors like Sonic 4.5, it achieves a remarkable 76% reduction in token usage at medium effort levels. This optimization not only enhances processing speed but also translates into substantial cost savings for users, making it a practical choice for both individual developers and large-scale organizations. The introduction of the "effort parameter" adds a new dimension of flexibility. This feature allows users to adjust the model's performance based on the complexity of their tasks. By fine-tuning the effort level, you can achieve the ideal balance between cost and output quality. Whether working on straightforward projects or tackling highly intricate coding challenges, this adaptability ensures that the model meets your specific requirements. Explore further guides and articles from our vast library that you may find relevant to your interests in Claude Opus 4.5. Claude Opus 4.5 is designed to be both cost-effective and high-performing. Its pricing structure is three times cheaper than previous models, with input tokens priced at $5 per million and output tokens at $25 per million. This affordability makes it accessible to a wide range of users, from independent developers to large organizations with frequent AI demands. Despite its lower cost, the model maintains exceptional quality across a variety of coding tasks. This balance between affordability and performance ensures that users can achieve their goals without exceeding their budgets. Whether you're developing complex software or addressing routine coding needs, Claude Opus 4.5 offers a practical and reliable solution. In benchmark evaluations, Claude Opus 4.5 consistently ranks as a top performer in coding-related categories, solidifying its reputation as a leading AI model. Its ability to handle diverse coding tasks with precision and efficiency makes it a preferred choice for developers seeking a dependable tool. However, its performance in non-coding benchmarks, such as graduate reasoning, visual reasoning, multilingual Q&A, and the vending machine benchmark, is slightly less dominant. These results highlight areas where further refinement could enhance its capabilities. Despite these minor limitations, the model's multifunctionality remains a key strength, allowing it to perform reliably across a wide range of use cases. For users seeking a single AI solution that excels in both coding and non-coding tasks, Claude Opus 4.5 offers a compelling combination of versatility and performance. Claude Opus 4.5 is more than just an AI coding model; it is a powerful tool that enables developers to achieve their goals efficiently and affordably. Its ability to handle complex coding tasks with fewer tokens and at a lower cost makes it an ideal choice for daily use. Whether you're working on sophisticated software development, experimenting with creative projects, or addressing routine coding challenges, this model provides a dependable and cost-effective solution. Its competitive pricing structure ensures that innovation remains accessible, allowing you to focus on delivering high-quality results without budget constraints. By combining performance, efficiency, and affordability, Claude Opus 4.5 sets a new standard for AI coding models, making it an invaluable resource for developers and organizations alike.
[25]
Claude Opus 4.5 Pricing & Performance : Anthropic's Most Efficient Model
What if the future of artificial intelligence wasn't just about being smarter but also about being leaner, faster, and more accessible? Enter Claude Opus 4.5, Anthropic's latest contribution to the AI landscape, which promises to redefine what's possible in efficiency and cost-effectiveness. Imagine an AI model that uses 65% fewer tokens while delivering exceptional performance, saving businesses both time and money. In a world where computational resources are precious and budgets are tight, this isn't just an incremental improvement, it's a fantastic option. With its strategic partnership with Google and access to over 1 million Tensor Processing Units (TPUs), Claude Opus 4.5 is poised to challenge the status quo and reshape how we think about AI scalability. In this overview, Caleb Writes Code explores how Claude Opus 4.5 addresses some of the most pressing challenges in the AI industry, from token inefficiency to the rising costs of intelligence. You'll discover how its innovative tools, like the Tool Search Tool and Code Execution Tool, are tailored for real-world applications, making it a practical choice for developers and businesses alike. But this isn't just about technology, it's about the broader implications for an increasingly competitive market and what it means for organizations striving to stay ahead. As we unpack the features and strategies behind this model, you may find yourself rethinking what you expect from AI. After all, efficiency isn't just a technical achievement, it's a necessity in the race to innovate. Claude Opus 4.5 Overview Efficiency and Performance: Transforming Token Usage Claude Opus 4.5 addresses one of the most persistent challenges in AI: token inefficiency. By using 65% fewer tokens while achieving an impressive SU benchmark score of 80.9%, the model delivers faster processing speeds and reduced computational demands. For you, this translates into tangible cost savings and improved operational efficiency. The model's efficiency is further enhanced by tools like the "tool search tool" and "programmatic tool calling", which dynamically retrieve relevant information and optimize the use of the context window. These features are particularly valuable for agentic applications, where maintaining accurate and relevant context is essential for high performance. Whether you are managing complex workflows or developing AI-driven solutions, these advancements ensure that resources are used effectively. Cost of Intelligence: Affordable AI for All Claude Opus 4.5 exemplifies the industry's shift toward affordable AI solutions. With pricing set at $5 per million tokens for input and $25 per million tokens for output, the model strikes a balance between cost and performance. While the output token cost may appear higher, the model's ability to reduce overall token usage offsets this expense, making it a cost-effective choice for businesses. For organizations like yours, this pricing structure means lower operational costs without compromising on functionality or intelligence. Whether you are scaling AI deployments or exploring new applications, Claude Opus 4.5 makes advanced AI technology more accessible, allowing you to achieve more with fewer resources. Claude Opus 4.5 vs Gemini 3 Here is a selection of other guides from our extensive library of content you may find of interest on Anthropic Opus. Google TPU Partnership: Expanding Computational Horizons Anthropic's collaboration with Google provides access to over 1 million Tensor Processing Units (TPUs), significantly enhancing the computational power of Claude Opus 4.5. This partnership not only boosts the model's performance but also diversifies the AI hardware market by reducing reliance on Nvidia's infrastructure. For you, this development ensures a more competitive and resilient ecosystem, fostering innovation and scalability. The partnership underscores the importance of collaboration in driving technological progress, offering businesses and developers access to innovative resources that were previously out of reach. Agentic Applications: Practical Tools for Real-World Challenges Claude Opus 4.5 is designed to excel in agentic use cases, where efficient context management and precision are critical. Two standout features demonstrate its practical utility: * Tool Search Tool: Dynamically retrieves documentation, conserving valuable context window space and making sure that relevant information is readily available for decision-making. * Code Execution Tool: Serves as an intermediary layer to streamline API communication, enhancing precision and reducing complexity in technical workflows. If you work in fields like software development, API integration, or other technical domains, these tools can significantly enhance productivity and accuracy. By focusing on real-world applications, Claude Opus 4.5 showcases its versatility and ability to address the challenges faced by modern developers and businesses. Competitive AI Landscape: Innovation in a Crowded Market The AI market is increasingly competitive, with major players like OpenAI, Google, Meta, and emerging Chinese models driving rapid advancements. This competition is essential for fostering innovation and making sure that the benefits of AI reach a broader audience, including businesses like yours. Anthropic's Claude Opus 4.5 distinguishes itself by prioritizing efficiency and cost-effectiveness. Its focus on practical applications and affordability makes it a compelling choice in a crowded field, particularly for organizations seeking reliable, innovative solutions that align with their operational goals. Market Implications: The Shift Toward Cost-Effective AI Claude Opus 4.5 reflects a broader industry trend toward reducing costs while enhancing functionality. Anthropic's strategic emphasis on API revenue and agentic applications highlights its commitment to delivering value to businesses and developers. For you, this represents an opportunity to use state-of-the-art AI technology at a fraction of the cost traditionally associated with such capabilities. Staying informed about these advancements will be crucial for maintaining a competitive edge in an ever-evolving market, where efficiency and affordability are becoming key differentiators. Shaping the Future of AI with Claude Opus 4.5 Anthropic's Claude Opus 4.5 is not merely an incremental improvement, it is a significant step forward in the AI industry. By prioritizing efficiency, reducing costs, and focusing on practical applications, it sets a new benchmark for what AI can achieve. Whether you are a developer, a business leader, or an AI enthusiast, understanding the capabilities of Claude Opus 4.5 will empower you to make informed decisions and capitalize on the opportunities presented by this innovative technology. As the AI landscape continues to evolve, models like Opus 4.5 will play a pivotal role in shaping the future of intelligent systems. Media Credit: Caleb Writes Code
[26]
Anthropic's Claude Opus 4.5 Launched, Surpasses Gemini 3 Pro in Coding Benchmarks
Anthropic Unveils Claude Opus 4.5, A New Peak Model That Outperforms Gemini 3 Pro in Coding Anthropic has officially launched Claude Opus 4.5, which comes at a time when demand for superior coding precision and reliable AI processes is increasing rapidly across sectors. The company claims this release is the best model yet, offering more precise support for developers, automation, and general professional tasks. The launch becomes increasingly important as businesses seek next-generation AI models that are not only faster but also more reliable than current systems. According to Anthropic, Opus 4.5 sets a new standard by excelling across real-world software engineering benchmarks. Its promise of surpassing leading rivals creates a compelling reason to watch how this model reshapes real-world development work.
[27]
Anthropic launches Claude Opus 4.5; model 'just gets it' By Investing.com
Investing.com -- Anthropic unveiled Claude Opus 4.5 on Monday, a major upgrade to its frontier artificial intelligence platform, delivering what the company calls "the best model in the world for coding, agents, and computer use." Available as of today across apps, API, and major cloud platforms, the model promises heightened reasoning skills, better performance in unseen domains, and substantial improvements in real-world software engineering benchmarks. Priced at $5/$25 per million tokens, the new offering aims to make advanced Opus-level capabilities more accessible to individuals, teams, and enterprises. Opus 4.5 leads on the SWE-bench Verified test, outperforming other frontier models in engineering problem solving, and offering new capabilities that Anthropic says will reshape how work gets done. According to Anthropic's internal testing, "Opus 4.5 just 'gets it'." The model shows a marked improvement in agentic capabilities, solving complex tasks with human-like creativity. The model also achieved the highest score ever, surpassing any human candidate, on Anthropic's take-home engineering test under time constraints, highlighting AI's expanding role in skilled technical roles. Beyond coding, Claude Opus 4.5 exhibits improvements in vision, mathematics, and research tasks. Through features like effort control on the Claude API, developers can now tailor performance output based on time and resource tradeoffs, while using significantly fewer tokens. Set to high effort, the model beats its predecessor's scores using less than half the output data volume. Security and safety are also central to the release. Opus 4.5 features improved resistance to prompt injection attacks and lower rates of "concerning behavior" across misalignment evaluations, positioning it as Anthropic's safest release to date. The company cited advanced safety evaluations and ongoing research under its Societal Impacts and Economic Futures program aimed at monitoring broader changes driven by AI. Enhanced integration across tools and platforms accompanies the model's launch, including updates to Claude Code, Claude for Excel, Chrome, and the developer platform. Users can now conduct multi-agent workflows in the desktop app or coordinate extended research tasks using Claude memory and subagent teams.
[28]
Anthropic Introduces Opus 4.5 : Handles Agent Tasks & Heavy Engineering with Higher Accuracy
What if the tools you rely on could think faster, adapt smarter, and solve problems with unprecedented precision? Enter Claude Opus 4.5, the latest breakthrough in artificial intelligence from Anthropic. This isn't just another AI upgrade, it's a model designed to redefine how industries approach complexity. Imagine debugging intricate code in seconds, analyzing massive datasets with ease, or even optimizing engineering workflows that once took weeks. With its seamless integration across major cloud platforms like AWS and Google Cloud, Claude Opus 4.5 is poised to become an indispensable ally for professionals navigating today's fast-paced, data-driven world. In this introduction, Anthropic explains how Claude Opus 4.5 transforms workflows across fields ranging from software development to healthcare. You'll discover its advanced coding capabilities, innovative problem-solving skills, and its ability to enhance decision-making with data-driven precision. But that's just the beginning, this AI model doesn't merely assist; it enables. Whether you're a front-end developer designing user-centric solutions or an engineer tackling resource-intensive simulations, Claude Opus 4.5 offers tools to amplify your potential. As we unpack its features, consider how this innovation might reshape not just your tasks, but the very way you approach challenges. Claude Opus 4.5 distinguishes itself with its advanced coding capabilities, making it an invaluable asset for developers. It can generate, debug, and optimize code with remarkable accuracy, significantly reducing the time spent on repetitive or error-prone tasks. For instance, the model excels at identifying and resolving complex bugs, a process that traditionally demands extensive manual effort. By automating these steps, Claude Opus 4.5 allows developers to focus on innovation and higher-level problem-solving. Beyond coding, the model demonstrates exceptional problem-solving skills across a wide range of domains. Its ability to analyze intricate scenarios and deliver actionable insights makes it a powerful tool for addressing challenges in fields such as engineering, data science, and technical research. Whether you're designing algorithms, troubleshooting systems, or interpreting large datasets, Claude Opus 4.5 provides the analytical support needed to achieve precise and efficient outcomes. Claude Opus 4.5 is designed to streamline decision-making processes by using advanced algorithms to evaluate multiple variables and potential outcomes. This capability is particularly valuable in business environments, where strategic decisions often involve balancing competing priorities. By delivering clear, data-driven recommendations, the model enables professionals to make informed choices with greater confidence. Its utility extends beyond strategic planning to routine operational tasks. For example, Claude Opus 4.5 can automate spreadsheet management, reducing manual effort and minimizing the risk of errors. Whether you're organizing data, performing complex calculations, or generating detailed reports, the model ensures accuracy while saving valuable time. This combination of strategic and operational support makes it an indispensable tool for improving productivity and efficiency across industries. Check out more relevant guides from our extensive collection on Claude Opus that you might find useful. Engineers benefit significantly from the computational power and precision of Claude Opus 4.5. The model is capable of handling resource-intensive tasks such as system design, simulation of real-world scenarios, and optimization of engineering processes. By accelerating these workflows, it enables engineers to achieve results that were previously unattainable within traditional timeframes. This enhanced productivity fosters innovation and allows teams to tackle more ambitious projects. Front-end developers also gain a competitive edge with Claude Opus 4.5. The model assists in creating intuitive user interfaces, optimizing design elements, and improving overall usability. By seamlessly integrating into development workflows, it helps developers deliver polished, high-quality products that align with user expectations. From enhancing visual aesthetics to making sure functional reliability, Claude Opus 4.5 supports the creation of user-centric solutions. Claude Opus 4.5 extends its capabilities to vision-related tasks, offering advanced processing and interpretation of visual data. Its high accuracy in image recognition, object detection, and video analysis makes it a valuable tool for industries such as healthcare, manufacturing, and security. For example, in healthcare, the model can assist in analyzing medical images to support diagnostic accuracy. In manufacturing, it can enhance quality control processes by identifying defects in real time. Similarly, in security applications, it can improve surveillance systems by detecting anomalies or potential threats with precision. These vision-related functionalities not only enhance operational efficiency but also contribute to improved reliability in critical applications. By using Claude Opus 4.5, organizations can achieve higher standards of performance and accuracy in tasks that rely on visual data. Claude Opus 4.5 is designed with accessibility in mind, offering seamless integration with all major cloud platforms. This compatibility eliminates the need for significant infrastructure investments, making the model a practical solution for businesses of all sizes. Whether you're a small startup or a large enterprise, the scalability of Claude Opus 4.5 ensures that it can adapt to your specific needs. The model's cloud-based deployment also simplifies implementation, allowing organizations to quickly integrate it into their existing workflows. This ease of use, combined with its powerful capabilities, makes Claude Opus 4.5 an ideal choice for professionals seeking to harness the potential of AI without the complexities of traditional deployment methods. Claude Opus 4.5 represents a fantastic advancement in artificial intelligence, offering unparalleled capabilities in coding, problem-solving, decision-making, and vision-related applications. Its adaptability across various tasks and industries, coupled with its seamless integration on leading cloud platforms, ensures that professionals can use its potential to drive innovation and operational efficiency. From tackling complex engineering challenges to optimizing workflows and enhancing user experiences, Claude Opus 4.5 is a powerful tool designed to help you achieve your goals with precision and confidence.
[29]
Anthropic launches Claude Opus 4.5, an AI model tailored for professionals and complex tasks
On Monday, the startup Anthropic unveiled Claude Opus 4.5, its most advanced artificial intelligence model to date. Designed to meet the needs of developers and knowledge professionals, this model aims to optimize complex tasks such as coding, financial analysis and data management. It is Anthropic's third major launch in two months, illustrating the intensifying competition in the generative AI sector. Claude Opus 4.5 now becomes the default model for the company's Pro, Max and Enterprise plans. Anthropic says that the model outperforms its competitors, including Google's Gemini 3 Pro and OpenAI's GPT-5.1, on benchmarks such as SWE-bench Verified, notably in "agentic" coding. It is also said to have achieved higher scores than human candidates in an internal evaluation test for engineers. More broadly, Claude Opus 4.5 shows improved performance in handling tables, creating presentations and conducting in-depth research, confirming its premium positioning within the Claude range, which also includes the Haiku and Sonnet models.
[30]
Anthropic bolsters AI model Claude's coding, agentic abilities with Opus 4.5
(Reuters) -Artificial intelligence startup Anthropic unveiled an upgraded Opus model on Monday, boosting Claude's ability to write detailed code, create sophisticated agents and streamline enterprise workflows through spreadsheet and financial analysis. The new model comes as Amazon and Alphabet-backed Anthropic races against OpenAI and other rivals to develop cutting-edge large language models aimed at achieving capabilities that could surpass human intelligence. Opus 4.5 ranks among the most powerful models in the Claude family, offering deep reasoning and memory, coding and a versatile performance across a range of computer applications, including financial tasks such as modeling and forecasting. Its agents autonomously refine their own capabilities and store insights from past work to apply at a later date, Anthropic said. (Reporting by Zaheer Kachwala in Bengaluru; Editing by Shilpi Majumdar)
[31]
Five big upgrades in Claude Opus 4.5 you should know about
Opus 4.5 upgrades boost productivity across writing, research and engineering tasks Anthropic's newest flagship model arrives at a moment when users expect more than raw capability from AI. Stability, efficiency and deeper task understanding matter just as much as speed. Claude Opus 4.5 attempts to meet those expectations with a set of upgrades that feel geared toward real workflows rather than eye-catching demos. The result is a model that aims to be a reliable partner for writing, research, coding and complex problem-solving. Also read: Indian AI healthcare revamp: 5 insights from Medi Assist and BCG report Reasoning has always been a core benchmark for large models, and Opus 4.5 shows a noticeable step up. It handles layered logic, cross-domain comparisons and multi-part instructions with better consistency. Writers building long arguments will find that the model sticks to the brief instead of drifting midway. Researchers can push for more detailed synthesis without losing structure. The upgrade stands out most clearly in tasks that need patience: step-by-step breakdowns, investigative summaries or extended planning documents. One of the clearest wins in this update comes from stronger engineering performance. Anthropic reports improvements across internal coding benchmarks, and that uplift translates smoothly into real usage. Code suggestions follow conventions more faithfully, functions are cleaner, and the model is noticeably better at spotting logical errors. It also avoids unnecessary explanations, which helps reduce token usage. Whether someone is drafting a utility script, exploring API behaviour or reviewing project code, Opus 4.5 feels more grounded and less prone to runaway improvisation. Also read: OpenAI vs Google: Why Sam Altman fears ChatGPT might be losing the AI race The model now produces more accurate and contextual responses while using fewer tokens. This makes a difference for anyone working with longer documents, high-frequency prompts or automated systems. Teams training the model on repetitive tasks will notice better clarity without pushing token budgets too high. For individuals writing long chapters or compiling detailed research notes, the upgrade offers steady savings without compromising depth. While raw model capability matters, how it fits into daily tools is just as important. Opus 4.5 benefits from better integration across spreadsheets, desktop environments and browsers. Data-heavy tasks, such as cleaning financial sheets or interpreting multi-column tables, feel smoother. Research workflows that jump between tabs, documents and visual sources become easier to manage with the model's improved context handling. These subtle ecosystem tweaks make the model feel more present across the full arc of a task, not just in isolated queries. Consistency has become a major demand from users who work with AI for client content, editorial pieces or sensitive communication. Opus 4.5 strengthens alignment and reduces prompt-injection vulnerabilities. This results in steadier tone, fewer contradictions and less drift across long sessions. For those who rely on the model for extended writing projects, documentation or journalistic analysis, the stability upgrade is essential. It allows conversations to stretch farther without constant course correction. \Claude Opus 4.5 doesn't reinvent the model. Instead, it tightens everything around the core experience: the reasoning is more dependable, the code is cleaner, the interactions cost less, the integrations reduce friction and the overall behaviour feels steadier. In a landscape filled with fast model releases, Opus 4.5 stands out for being a grounded, practical upgrade aimed squarely at people who use AI as part of their everyday creative and analytical routine.
[32]
Anthropic introduces Claude Opus 4.5, its most capable model yet, that can beat Gemini 3 Pro in coding
Anthropic also rolled out upgrades to the Claude Developer Platform, including improved Claude Code and higher usage limits. After Google, Anthropic has unveiled its most advanced model yet, featuring advancements in software engineering and agentic AI. The Claude Opus 4.5 model is now available through Anthropic's apps, API, and all major cloud platforms, with prices starting at $5 for input and $25 for output per million tokens. Taking to the blog post, the company stated that the Opus 4.5 is capable of delivering best-in-class performance on real-world coding evaluations, including the SWE-bench Verified, where it outscored all the competing frontier systems, notably surpassing Google's Gemini 3 Pro in software engineering tasks. It also added that the Opus 4.5 solved complex, multi-system bugs more reliably than its predecessors and competitors, often completing challenges that were previously out of reach for Sonnet 4.5. Anthropic also stated that the model got the highest score ever recorded on its internal engineering take-home exam within the two-hour limit, exceeding any human candidate evaluated so far. Additionally, the model also demonstrated unusually strong agentic reasoning, with testers highlighting its ability to find creative paths through multi-step tasks. Anthropic also claimed that Opus 4.5 is its most robustly aligned system to date. It showed the strongest resistance among frontier models to sophisticated prompt-injection attacks. The company also announced upgrades to the Claude Developer Platform, including a new "effort" control for adjusting depth of reasoning, improvements to Claude Code, expanded desktop support, and better long-context handling across the Claude app, Chrome extension, and Excel integration. The company has also increased the usage limits for Opus 4.5 for Max and Team Premium users. "Alongside Opus, we're releasing updates to the Claude Developer Platform, Claude Code, and our consumer apps. There are new tools for longer-running agents and new ways to use Claude in Excel, Chrome, and on desktop," the company stated in its blog post.
Share
Share
Copy Link
Anthropic releases its most powerful AI model Claude Opus 4.5 with a 67% price reduction, targeting coding and office work applications. The launch follows recent model releases from Google and OpenAI, intensifying competition in the enterprise AI market.

Anthropic announced the release of Claude Opus 4.5 on Monday, marking a significant milestone in the company's AI development strategy with a substantial 67% price reduction that repositions its flagship model from a premium offering to a production-ready enterprise solution
3
. The new pricing structure sets input tokens at $5 per million and output tokens at $25 per million, down from the previous $15 and $75 respectively, bringing Anthropic closer to competitors while maintaining its premium market position3
.Claude Opus 4.5 represents a strategic shift toward workplace productivity, specifically designed for coding and office work rather than content generation
1
. The model excels at producing documents, spreadsheets, and presentations while automating routine office tasks through computer and browser interaction capabilities1
. This functionality extends to Claude for Chrome, a browser extension that enables Max users to delegate internet-based tasks to the AI system1
.The launch occurs amid intense competition in the enterprise AI sector, following Google's Gemini 3 release last week and OpenAI's GPT-5.1 launch two weeks prior
3
. Despite the price reduction, Anthropic maintains a premium position compared to competitors, with OpenAI's GPT-5.1 priced at $1.25 per million input tokens and $10 per million output tokens, and Google's Gemini 3 Pro ranging from $2 to $4 per million input tokens3
.As an advanced reasoning model, Claude Opus 4.5 employs sophisticated processing techniques that involve rerunning and refining operations to deliver more accurate and complete responses
1
. This approach makes the model particularly effective for complex programming projects and intensive research tasks, though it requires more computational resources and time compared to standard language models1
. The model demonstrates significant improvements in vision, reasoning, and mathematics compared to its predecessors5
.Related Stories
Claude Opus 4.5 is immediately available across multiple platforms, including Anthropic's consumer applications, API access, and all three major cloud providers: Amazon Web Services, Google Cloud Platform, and Microsoft Azure
5
. The model serves as the default option for Pro subscribers ($17/month), Max subscribers ($100/month), and Enterprise users1
. This release completes Anthropic's 4.5 generation lineup, following the September launch of Sonnet 4.5 and October's Haiku 4.5 release1
.The model launch coincides with significant corporate developments for Anthropic, including recent multi-billion-dollar investments from Microsoft and Nvidia that have elevated the company's valuation to approximately $350 billion
4
. Founded in 2021 by former OpenAI researchers and executives, Anthropic has established itself as a major player in the enterprise AI market4
. Scott White, product leader for Claude.ai, expressed enthusiasm about the rapid development pace and market feedback loops generated by the company's frequent model releases4
.Summarized by
Navi
[2]
[5]
23 May 2025•Technology

06 Aug 2025•Technology

25 Feb 2025•Technology
