OpenAI launches GPT Image 1.5 as AI image generator war with Google intensifies

Reviewed byNidhi Govil

17 Sources

Share

OpenAI released GPT Image 1.5, a new image generation model that generates images up to four times faster than its predecessor and costs about 20 percent less through the API. The release comes as OpenAI responds to Google's viral Nano Banana Pro model, with CEO Sam Altman having declared a 'code red' after Gemini 3's success. The new model promises better instruction-following and photorealistic image manipulation through a dedicated creative studio interface in ChatGPT.

OpenAI Releases GPT Image 1.5 to Counter Google's Lead

OpenAI rolled out GPT Image 1.5 on Tuesday, marking its latest escalation in the intensifying AI image generation market competition with Google. The new image generation model generates images up to four times faster than its predecessor while costing approximately 20 percent less through the API

1

2

. Available immediately to all ChatGPT users globally and via API, the release represents OpenAI's direct response to Google's Nano Banana Pro, which went viral after its August release and has maintained dominance on the LMArena leaderboard across multiple benchmarks

2

3

.

Source: Beebom

Source: Beebom

The timing reflects OpenAI CEO Sam Altman's internal code red declaration last month, which detailed plans to regain the company's position as AI leader after Google began capturing market share with Gemini 3 and Google's Nano Banana image models

2

5

. OpenAI had reportedly planned to release the update in early January but accelerated the launch following competitive pressure. This marks the company's first image model release since GPT-Image-1 launched in April

2

.

Native Multimodal Architecture Enables Photorealistic Image Manipulation

GPT Image 1.5 distinguishes itself as a native multimodal AI image generator, meaning image synthesis happens inside the same neural network that processes language prompts

1

. This contrasts with DALL-E 3, OpenAI's earlier image generator previously built into ChatGPT, which used diffusion techniques. The newer architecture treats images and text as identical data types called tokens to be predicted, processing uploaded photos and text prompts in a unified space before outputting new pixels the same way it would generate the next word in a sentence

1

.

This technical approach enables improved instruction following and more precise photo editing features. Users can now change someone's pose or position, render scenes from different angles, remove objects, adjust clothing, and refine specific areas while preserving facial likeness across successive edits

1

3

. The model also promises better generation of legible text in images, a notoriously difficult task where even the first generation fell short

3

5

.

Source: Axios

Source: Axios

Creative Studio Interface Transforms ChatGPT Experience

Fidji Simo, OpenAI's CEO of applications, explained that ChatGPT's chat interface was never designed for visual work. "Creating and editing images is a different kind of task and deserves a space built for visuals," Simo wrote

1

3

. OpenAI introduced a dedicated Images tab in the ChatGPT sidebar that functions more like a creative studio interface, featuring preset filters, trending prompts, and specialized viewing and editing screens

2

4

.

Source: Gadgets 360

Source: Gadgets 360

The interface addresses a critical weakness in most GenAI image tools: poor iteration capabilities. When asked for specific changes like adjusting facial expressions or modifying lighting, earlier models would often reinterpret the entire image, leading to inconsistent results

2

. GPT Image 1.5 provides more granular edit controls to maintain visual consistency, including facial likeness, lighting, composition, and color tone across edits

2

. Users can now converse with the AI model about photographs, refining and revising iteratively the same way they might workshop an email draft in ChatGPT

1

.

Enterprise Focus and Disney Partnership Signal Production-Ready Shift

OpenAI positions the new model as especially useful for enterprise users, part of its push to turn a profit under investor pressure

4

. The company describes it as "a shift from novelty image generation to practical, high-fidelity visual creation," turning ChatGPT into a fast, flexible creative studio for everyday edits, expressive transformations, and real-world use

4

. Simo emphasized that search queries will display more visuals with clear sources, helpful for tasks like converting measurements or checking sports scores

2

.

The release comes one week after Disney and OpenAI struck an agreement to bring over 200 Disney characters to ChatGPT images and Sora AI videos, set to launch in early 2026

3

5

. This partnership signals how image and video generators are advancing beyond prototypes toward production-ready capabilities with commercial applications.

Societal Implications and Ethical Debates Intensify

The capability to seamlessly manipulate photographs raises significant concerns about misuse potential. For most of photography's roughly 200-year history, altering a photo convincingly required darkroom skills, Photoshop expertise, or at minimum a steady hand with scissors and glue. OpenAI's release reduces the process to typing a sentence

1

. Barriers to realistic photo editing keep dropping, with GPT Image 1.5 removing yet more friction from photorealistic image manipulation

1

.

This seamless manipulation may prompt a cultural recalibration of what visual images mean to society. For most of photography's history, convincing forgery required skill, time, and resources—barriers that made fakery rare enough that photographs could serve as reasonable proxies for truth

1

. That era has ended due to AI, with Google's Nano Banana Pro already renewing fears that identifying AI-generated content grows harder

3

5

.

Since March, creators including authors, writers, and actors have spoken out about dangers of AI tools using human-created artistic works and human likenesses to create AI content. OpenAI's video generator Sora inflamed these ethical debates this fall

5

. The first generation of OpenAI's image model sparked trends like "Studio Ghibli" versions of users, reigniting debate about ethics and legality of AI creative tools, particularly given Hayao Miyazaki's statement that AI tools are "an insult to life itself"

3

5

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo