OpenAI's Experimental Model Achieves Gold Medal Performance at International Math Olympiad

Reviewed byNidhi Govil

2 Sources

OpenAI's latest experimental AI model has demonstrated gold medal-level performance at the 2025 International Math Olympiad, solving 5 out of 6 problems and scoring 35 out of 42 points. This achievement marks a significant milestone in AI's reasoning capabilities.

OpenAI's Experimental Model Achieves Gold Medal Performance at IMO

In a groundbreaking development for artificial intelligence, OpenAI has announced that its experimental model has achieved "gold medal-level performance" at the 2025 International Math Olympiad (IMO). This achievement marks a significant milestone in AI's ability to tackle complex mathematical problems requiring creative reasoning 1.

Source: Analytics India Magazine

Source: Analytics India Magazine

The Model's Performance

The unreleased AI model successfully solved five out of six problems at the IMO, earning an impressive 35 out of 42 points. This performance places it on par with the top 10% of human contestants who typically receive gold medals in this prestigious competition 1.

Alexander Wei, a research scientist at OpenAI, emphasized the significance of this accomplishment, stating that the model can now "craft intricate, watertight arguments at the level of human mathematicians" 1.

Evaluation Process and Conditions

The AI model was evaluated under the same rigorous conditions as human participants:

  1. Two 4.5-hour sessions
  2. No access to external tools or the internet
  3. Required to write detailed proofs based on official IMO problems

Three former IMO medalists independently graded each solution, with final scores based on unanimous agreement 2.

Implications for AI Development

This achievement represents a significant leap in AI's reasoning capabilities. Wei contextualized the progress by noting the progression of reasoning benchmarks:

"We've now progressed from GSM8K (~0.1 min for top humans) → MATH benchmark (~1 min) → AIME (~10 mins) → IMO (~100 mins)" 2.

The success at the IMO demonstrates advancements in "general-purpose reinforcement learning and test-time compute scaling" [2](https://analyticsindiamag.com/ai-news-updates/openais-reasoning-model-wins-gold-at-2025-imo-gpt-5-coming-soon()].

Future Releases and GPT-5

While this experimental model showcases impressive capabilities, OpenAI does not plan to release anything with this level of math capability for several months. The upcoming GPT-5, which is expected to be released soon, will likely be an improvement from its predecessor but won't feature the same level of mathematical prowess as the IMO-winning model 1 2.

Source: engadget

Source: engadget

Comparison to Previous AI Achievements

This accomplishment surpasses previous AI performances in mathematical competitions. Last year, Google DeepMind's AlphaProof and AlphaGeometry 2 solved four out of six problems from the IMO, achieving a score equivalent to a silver medalist 2.

OpenAI's achievement at the IMO underscores the rapid advancement of AI in recent years, surpassing earlier predictions and setting new benchmarks for machine intelligence in complex problem-solving tasks.

Explore today's top stories

Meta's Bold Leap: Zuckerberg's $100 Billion Gamble on AI Superintelligence

Meta, under Mark Zuckerberg's leadership, is making a massive investment in AI, aiming to develop "superintelligence" with a new elite team and billions in infrastructure spending.

The Atlantic logoThe Motley Fool logo

2 Sources

Technology

22 hrs ago

Meta's Bold Leap: Zuckerberg's $100 Billion Gamble on AI

The Dos and Don'ts of AI Chatbot Usage: Navigating the Ethical and Practical Boundaries

As AI chatbots like ChatGPT gain popularity, users must be aware of their limitations and potential risks. This article explores scenarios where using AI chatbots may be inappropriate or dangerous, emphasizing the importance of responsible AI usage.

CNET logoMashable logo

2 Sources

Technology

22 hrs ago

The Dos and Don'ts of AI Chatbot Usage: Navigating the

Nvidia Faces Supply Challenges in Resuming AI Chip Sales to China

Nvidia encounters production obstacles for its H20 AI chips intended for the Chinese market, despite plans to resume sales amid U.S. export restrictions.

Reuters logoMarket Screener logo

2 Sources

Business and Economy

14 hrs ago

Nvidia Faces Supply Challenges in Resuming AI Chip Sales to

AI Data Centers Strain Local Water Resources, Raising Environmental Concerns

Meta's data center in Newton County, Georgia, is linked to water scarcity issues, highlighting the environmental impact of AI infrastructure on local communities.

Futurism logoEconomic Times logo

2 Sources

Technology

14 hrs ago

AI Data Centers Strain Local Water Resources, Raising

Gabe Newell Predicts AI Tools Will Reshape Game Development Landscape

Valve co-founder Gabe Newell discusses the potential impact of AI on game development, suggesting that AI tools could make non-programmers more effective than experienced developers in creating value.

pcgamer logoTweakTown logogamesradar logo

3 Sources

Technology

1 day ago

Gabe Newell Predicts AI Tools Will Reshape Game Development
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo