OpenAI Releases GPT-5.4, New AI Model Built for Agents and Professional Work

Reviewed byNidhi Govil

30 Sources

Share

OpenAI launched GPT-5.4 Thinking and Pro models on Thursday, designed specifically for AI agents and enterprise applications. The company claims the new AI model delivers 33% fewer false claims and matches human professionals 83% of the time across 44 occupations. The release comes amid growing competition with Anthropic's Claude and controversy over OpenAI's Pentagon deal.

OpenAI Unveils GPT-5.4 as Competition with Anthropic Intensifies

OpenAI released its latest AI model, GPT-5.4, on Thursday, just two days after launching GPT-5.3 Instant. The company describes GPT-5.4 as its "most capable and efficient frontier model for complex professional work," bringing together recent advancements in reasoning and coding alongside agentic workflows into a unified system

1

. Available as GPT 5.4 Thinking for ChatGPT subscribers and GPT 5.4 Pro through the API, the model is specifically designed to support AI agents—autonomous systems that can operate independently with minimal human intervention

3

.

Source: Geeky Gadgets

Source: Geeky Gadgets

The release positions OpenAI directly against Anthropic and its Claude models, particularly as both companies compete for enterprise applications and business professionals willing to pay monthly subscriptions

1

. Recent reports indicate Anthropic's popularity has surged, with Claude mobile apps claiming top spots in Apple's and Google's app stores, while online forums fill with advice on transferring data from ChatGPT to Claude

1

.

Fewer Errors and Enhanced Capabilities for Professional Work

OpenAI calls GPT-5.4 its "most factual model yet," addressing ongoing concerns about AI hallucinations where models generate false information. According to OpenAI's benchmarks, responses from GPT-5.4 are 18% less likely to contain errors, while individual claims are 33% less likely to be false compared to GPT-5.2

1

3

. The company emphasizes that users should still fact-check AI-generated content despite these improvements.

Source: TechRadar

Source: TechRadar

Perhaps most striking, OpenAI's testing reveals that GPT-5.4 can match or outperform human professionals 83% of the time across nine industries and 44 real-world occupations

2

. The company introduced GPTval in September, an evaluation test measuring AI performance on "economically valuable, real-world tasks" in industries contributing at least 5% to US gross domestic product

2

. On the OSWorld-Verified benchmark, which monitors AI's ability to navigate desktop environments, GPT-5.4 scored 75%, up from 47.3% with GPT-5.2 and exceeding the average human result of 72.4%

3

.

Native Computer Control Powers Autonomous Agents Forward

One of GPT-5.4's most significant updates involves its ability to use native computer resources, enabling autonomous agents to complete complex tasks across multiple applications. The model can write code to operate computers, responding to mouse and keyboard commands based on screenshot analysis

3

. This capability allows developers to build agents that operate other services with limited human interaction, marking a substantial step toward fully autonomous systems

3

.

OpenAI emphasizes that GPT-5.4 can more efficiently support agentic activity, using less computing power and therefore costing less money than previous models

1

. The model also allows users to adjust answers mid-response while generating, enabling course corrections without starting fresh—a feature immediately available on Android and ChatGPT's website, with iPhone support coming soon

3

.

Financial-Services Tools Target Enterprise Market

OpenAI announced a new suite of financial-services tools alongside GPT-5.4, designed to help professionals streamline financial analysis, investment memos, and other specialized work

4

. The product connects with ChatGPT apps from financial data firms like FactSet Research Systems Inc. and Third Bridge, while also enabling direct ChatGPT use in Microsoft Excel and Google Sheets for creating and examining financial models

4

.

Internal testing found that spreadsheets generated to emulate a junior investment banking analyst achieved a mean success rate of 87.3% with human raters

3

. The model demonstrates improved capabilities in generating AI-powered spreadsheets, documents, and presentations, requiring less back-and-forth interaction with users

4

. OpenAI also introduced ChatGPT for Excel as a dedicated tool to help users run scenarios and generate outputs based on cells and formulas

3

.

Pentagon Deal Controversy Shadows Launch

The GPT-5.4 release arrives amid controversy surrounding OpenAI's relationship with the Department of Defense. After Anthropic was declared a "supply-chain risk" by the Pentagon following its refusal to allow AI use for mass surveillance of Americans or fully autonomous weapon systems, OpenAI struck a $200 million deal with the defense department in 2025

1

4

.

Source: Digit

Source: Digit

Sam Altman later acknowledged that OpenAI's rush to forge the Pentagon agreement looked "opportunistic and sloppy," stating the company was working to "make some additions in our agreement to make our principles very clear"

4

. Altman clarified that safeguards would be implemented and that the technology wouldn't be made available to intelligence agencies like the NSA, though significant questions remain about how AI is being used by government agencies and defense contractors

1

. Anthropic has reportedly resumed talks with the Pentagon following the initial breakdown

4

.

Availability and Rollout Details

GPT 5.4 Thinking is rolling out now for Plus, Pro, and Team subscribers through ChatGPT, replacing the GPT 5.2 Thinking model

3

. The previous version will be moved to Legacy Models before removal on June 5

3

. GPT 5.4 Pro is available through the API for Pro and Enterprise plans, as well as for ChatGPT Enterprise and Edu subscribers

5

. The model is also available in Codex, OpenAI's coding application, with API access beginning Friday

1

2

. No announcement has been made regarding availability for free users.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo