OpenAI Unveils Advanced AI Reasoning Models o3 and o4-mini with Enhanced Visual and Tool Integration

15 Sources

OpenAI launches o3 and o4-mini, new AI reasoning models with improved performance in coding, math, science, and visual understanding. These models can integrate images into their reasoning process and use ChatGPT tools independently.

News article

OpenAI Introduces Advanced AI Reasoning Models

OpenAI has unveiled its latest AI reasoning models, o3 and o4-mini, marking a significant advancement in artificial intelligence technology. Announced on Thursday, these new models are designed to pause and work through questions before responding, offering improved performance across various domains 1.

Enhanced Capabilities and Performance

The o3 model is touted as OpenAI's most advanced reasoning model to date, outperforming previous iterations in tests measuring math, coding, reasoning, science, and visual understanding capabilities. Meanwhile, o4-mini provides a competitive balance between price, speed, and performance, catering to developers' needs when choosing an AI model for their applications 1.

Integration with ChatGPT Tools

A key feature of o3 and o4-mini is their ability to generate responses using various ChatGPT tools, including:

  • Web browsing
  • Python code execution
  • Image processing
  • Image generation

This integration allows the models to tackle complex, multi-step problems more effectively 1.

Visual Understanding and "Thinking with Images"

OpenAI claims that o3 and o4-mini are their first models capable of "thinking with images." This means users can upload images such as whiteboard sketches or diagrams, which the models will analyze during their "chain-of-thought" phase before answering. The models can understand blurry and low-quality images and perform tasks like zooming or rotating images as part of their reasoning process 2.

Availability and Access

The new models are available to subscribers of OpenAI's Pro, Plus, and Team plans. They can be accessed through:

  1. ChatGPT interface
  2. OpenAI's developer-facing endpoints (Chat Completions API and Responses API)

OpenAI plans to release o3-pro, a more powerful version of o3, in the coming weeks exclusively for ChatGPT Pro subscribers 1.

Codex CLI: A New Open-Source Agent

Alongside the new models, OpenAI has launched Codex CLI, an open-source coding agent that runs locally in users' terminals. This tool provides a simple way to connect AI models, including o3 and o4-mini, to users' own code and tasks running on their computers 5.

Implications for the AI Industry

The release of o3 and o4-mini is part of OpenAI's ongoing effort to maintain its competitive edge in the global AI race against companies like Google, Meta, xAI, Anthropic, and DeepSeek. These new models represent a shift towards AI systems that can perform more complex reasoning tasks, potentially opening up new applications in scientific research, business strategy, and problem-solving across various industries 3.

Future Developments

OpenAI CEO Sam Altman has indicated that o3 and o4-mini may be the company's last standalone AI reasoning models in ChatGPT before the release of GPT-5, which is expected to unify traditional models like GPT-4.1 with its reasoning models 1. This suggests a continued evolution in AI technology, with potential implications for various sectors and applications.

Explore today's top stories

Google Unveils Pixel 10 Series: AI-Powered Features and Camera Upgrades Take Center Stage

Google has launched its new Pixel 10 series, featuring improved AI capabilities, camera upgrades, and the new Tensor G5 chip. The lineup includes the Pixel 10, Pixel 10 Pro, and Pixel 10 Pro XL, with prices starting at $799.

Ars Technica logoTechCrunch logoCNET logo

60 Sources

Technology

16 hrs ago

Google Unveils Pixel 10 Series: AI-Powered Features and

Google Unveils AI-Powered Pixel 10 Smartphones with Advanced Gemini Features

Google launches its new Pixel 10 smartphone series, showcasing advanced AI capabilities powered by Gemini, aiming to compete with Apple in the premium handset market.

Bloomberg Business logoThe Register logoReuters logo

22 Sources

Technology

16 hrs ago

Google Unveils AI-Powered Pixel 10 Smartphones with

NASA and IBM Unveil Surya: An AI Model to Predict Solar Flares and Space Weather

NASA and IBM have developed Surya, an open-source AI model that can predict solar flares and space weather with improved accuracy, potentially helping to protect Earth's infrastructure from solar storm damage.

New Scientist logoengadget logoGizmodo logo

6 Sources

Technology

1 day ago

NASA and IBM Unveil Surya: An AI Model to Predict Solar

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered Wearables

Google's latest smartwatch, the Pixel Watch 4, introduces significant upgrades including a curved display, AI-powered features, and satellite communication capabilities, positioning it as a strong competitor in the smartwatch market.

TechCrunch logoCNET logoZDNet logo

18 Sources

Technology

15 hrs ago

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered

FieldAI Secures $405M Funding to Revolutionize Robot Intelligence with Physics-Based AI Models

FieldAI, a robotics startup, has raised $405 million to develop "foundational embodied AI models" for various robot types. The company's innovative approach integrates physics principles into AI, enabling safer and more adaptable robot operations across diverse environments.

TechCrunch logoReuters logoGeekWire logo

7 Sources

Technology

16 hrs ago

FieldAI Secures $405M Funding to Revolutionize Robot
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo