OpenAI launches ChatGPT Images 2.0 with text rendering that finally works

Reviewed byNidhi Govil

28 Sources

Share

OpenAI released ChatGPT Images 2.0, a new AI image model that can accurately generate text in multiple languages and create complex visual content like infographics and marketing materials. The model includes thinking capabilities, web information access, and can generate up to eight images from a single prompt while maintaining visual consistency.

OpenAI Unveils ChatGPT Images 2.0 With Major Text Rendering Breakthrough

OpenAI launched ChatGPT Images 2.0 on Tuesday, marking a significant leap in image generation capabilities that addresses one of AI's most persistent challenges: creating legible, accurate text. The new AI image model can now generate restaurant menus, infographics, study guides, and marketing materials with text that looks professionally designed rather than garbled nonsense

1

. Just two years ago, asking DALL-E 3 to create a Mexican restaurant menu resulted in fictional dishes like "enchuita," "churiros," and "margartas." Now, ChatGPT Images 2.0 produces menu text clean enough for immediate commercial use

1

.

Source: SiliconANGLE

Source: SiliconANGLE

The improved text rendering extends beyond English, with robust non-Latin language support for Japanese, Korean, Hindi, Bengali, Chinese, and Gujarati. When tested with Gujarati instructions, the model delivered clear text with grammatical accuracy and natural phrasing

5

. This capability positions ChatGPT Images 2.0 as a practical tool for global businesses and educators creating multilingual content.

Thinking Capabilities Enable Complex Multi-Image Workflows

ChatGPT Images 2.0 integrates thinking capabilities that fundamentally change how the model approaches image generation. Instead of simply matching prompt details, the AI can now search the web for recent information, generate multiple images from a single prompt, and double-check its creations

1

. The model's knowledge cutoff extends to December 2025, allowing it to incorporate current information into visual outputs

2

.

Source: VentureBeat

Source: VentureBeat

When asked to "Generate an infographic about activities I should do with tomorrow's weather in San Francisco in mind," the model gathered weather data, determined appropriate activities, and created an image featuring accurate weather details alongside recognizable landmarks like the Ferry Building, Castro Theater, Painted Ladies houses, and Transamerica Pyramid

2

4

. Users can generate up to eight images at once while maintaining consistency across characters, font, color palette, and overall mood—particularly useful for creating comic book pages or social media assets in various sizes

5

.

Customizable Aspect Ratios and High-Resolution Outputs

The new model addresses a long-standing frustration with customizable aspect ratios ranging from 3:1 wide to 1:3 tall, giving users precise control over image dimensions

2

4

. Developers using the API can create high-resolution images in 2K and 4K resolution, though these higher resolutions remain in beta

3

. According to OpenAI, the model delivers "an unprecedented level of specificity and fidelity to image creation," handling small text, iconography, UI elements, dense compositions, and subtle stylistic constraints at up to 2K resolution

1

.

Source: ZDNet

Source: ZDNet

The focus on text-heavy visuals represents a strategic shift for OpenAI. Just one month after shuttering its Sora AI video app to concentrate on "core products," the company is building what it calls tools for "economically valuable creative tasks"

3

. Adele Li, product lead for ChatGPT Images, explained that "visual intelligence" is critical to ChatGPT's vision for developing a personal creative assistant

3

.

Positioning Against Competitors and Practical Limitations

ChatGPT Images 2.0 competes directly with Google's Nano Banana Pro and Nano Banana 2, which previously led in text rendering capabilities

3

5

. Unlike Midjourney's artistic focus or Adobe Firefly's professional editing tools, ChatGPT Images 2.0 targets working professionals who need attractive content quickly—teachers creating lesson plans, marketing managers producing social media posts, and businesses developing infographics

3

.

However, early testing revealed limitations. When attempting to reproduce brand logos accurately, the model struggled despite repeated instructions. In one test with the ZDNET logo, the AI either distorted elements, retrieved outdated versions, or added incorrect design features

4

. Additionally, users cannot directly edit generated images—they must regenerate them entirely, which consumes credits faster when working with complex text-heavy designs

3

.

All ChatGPT and Codex users can access ChatGPT Images 2.0 starting Tuesday, with generation limits based on subscription tiers. Advanced thinking outputs with web information access remain exclusive to ChatGPT Plus, Pro, and Business users

5

. The gpt-image-2 API is also available, with pricing dependent on quality and resolution

1

. OpenAI maintains safety measures including C2PA metadata standards for identifying AI-generated content and policies prohibiting abusive and illegal imagery

3

.

Today's Top Stories

TheOutpost.ai

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Instagram logo
LinkedIn logo
Youtube logo
© 2026 TheOutpost.AI All rights reserved