OpenAI launches GPT-5.4 mini and nano models built for speed over raw power

Reviewed byNidhi Govil

13 Sources

Share

OpenAI released GPT-5.4 mini and GPT-5.4 nano, marking a shift from power to speed in AI development. The mini model runs more than twice as fast as its predecessor while approaching flagship GPT-5.4 performance on coding benchmarks. Meanwhile, nano targets high-volume tasks like data classification at just $0.20 per million input tokens, enabling developers to build efficient multi-model workflows.

News article

OpenAI shifts focus to speed with GPT-5.4 mini and nano release

OpenAI released GPT-5.4 mini and GPT-5.4 nano on Tuesday, introducing faster AI models designed for high-volume AI workloads where speed matters more than raw computational power

1

. The launch represents a strategic pivot toward efficiency as the company battles Anthropic for dominance in the AI software engineering market

1

.

GPT-5.4 mini runs more than twice as fast as GPT-5 mini while delivering near flagship performance across coding, improved reasoning and tool use, and multimodal understanding

1

3

. On SWE-Bench Pro, the mini model scores 54.4 percent compared to 57.7 percent for the full GPT-5.4, while on OSWorld-Verified it reaches 72.1 percent versus 75 percent for the larger version

5

. These benchmarks demonstrate that developers can achieve strong performance without the expense of flagship models.

Lower cost AI enables new development strategies

The pricing structure makes these models particularly attractive for developers managing budget constraints. GPT-5.4 mini costs $0.75 per million input tokens and $4.50 per million output tokens, while GPT-5.4 nano comes in at $0.20 and $1.25 respectively

5

. Both models support text and image inputs, function calling, and a 400,000 token context window, ensuring core capabilities remain intact despite the lower price point

5

.

In Codex, OpenAI's coding software, the mini model uses just 30 percent of the GPT-5.4 quota, allowing developers to shift routine AI for coding tasks to a cheaper tier while reserving the full model for complex reasoning

5

. This approach directly challenges Anthropic's Claude Code, which gained attention for its ability to create applications from scratch

1

.

Subagents and multi-model workflows reshape AI architecture

OpenAI envisions these models powering subagents within larger agentic workflows, where a powerful model like GPT-5.4 handles planning and coordination while smaller models execute specific tasks

2

4

. OpenAI suggests GPT-5.4 mini excels at editing and debugging code, while GPT-5.4 nano handles data classification and extraction tasks

1

.

According to Aabhas Sharma, CTO at Hebbia, "GPT-5.4 mini delivers strong end-to-end performance for a model in this class. In our evaluations, it matched or exceeded competitive models on several output tasks and citation recall at a much lower cost"

2

. Abhisek Modi, AI engineering lead at Notion, noted that the mini model "matched and often exceeded GPT-5.2 on handling complex formatting at a fraction of the compute"

2

.

Access and availability across ChatGPT and API

GPT-5.4 mini is available for developers through the API and through Codex and ChatGPT

1

. ChatGPT Free and Go users can access it through the "Thinking" feature, while paid users will encounter it as a fallback model when they hit the rate limit for GPT-5.4 Thinking

1

3

. GPT-5.4 nano remains exclusive to the API, targeting teams running high-volume tasks where efficiency and cost control are critical

3

5

.

These models are built for workloads where latency directly shapes the product experience, including coding assistants that need to feel responsive, computer-using systems that capture and interpret screenshots, and multimodal applications that can reason over images in real-time

2

. As AI systems evolve from single powerful models to coordinated teams of specialized models, users may never directly select these options but will notice faster responses, more reliable performance, and seamless integration across the tools they use daily

4

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo