AI Models from DeepMind and OpenAI Achieve Gold Medal Performance in International Mathematical Olympiad

Reviewed byNidhi Govil

38 Sources

Share

Google DeepMind and OpenAI's AI models have achieved gold medal-level performance in the 2025 International Mathematical Olympiad (IMO), marking a significant advancement in AI's mathematical reasoning capabilities.

AI Models Achieve Gold Medal Performance in IMO

In a significant advancement for artificial intelligence, both Google DeepMind and OpenAI have announced that their AI models achieved gold medal-level performance in the 2025 International Mathematical Olympiad (IMO). This prestigious competition, running since 1959, is known for its challenging proof-based problems that test mathematical reasoning and creativity

1

3

.

DeepMind's Gemini Deep Think

Source: Entrepreneur

Source: Entrepreneur

Google DeepMind's AI model, named Gemini Deep Think, scored 35 out of 42 points on the six IMO problems, correctly solving five out of six questions

1

2

. This performance marks a significant improvement over their 2024 entry, which achieved a silver medal equivalent using specialized systems AlphaProof and AlphaGeometry 2

2

.

Thang Luong, a senior scientist at DeepMind, described this year's achievement as a "big paradigm shift"

1

. Unlike previous attempts that required human experts to translate problems into a formal language, Gemini Deep Think processed and solved problems entirely in natural language

1

2

.

OpenAI's Experimental Model

Source: Analytics India Magazine

Source: Analytics India Magazine

OpenAI also reported that its experimental AI model achieved gold medal-level performance on the IMO problems

3

. However, their announcement came earlier than expected, causing some controversy within the AI and mathematical communities

3

4

.

Both companies claim their models operated under the same time constraints as human participants: 4.5 hours per session, without internet access or calculators

1

3

.

Significance and Implications

This achievement represents a major step forward in AI's ability to handle complex mathematical reasoning. Gary Marcus, a neuroscientist and AI critic, called the results "awfully impressive," noting that solving problems at this level demonstrates "really good math problem solving chops"

1

.

The success of these general-purpose language models in tackling IMO problems suggests potential applications beyond mathematics. Researchers from both companies believe these advancements could lead to AI systems capable of addressing challenging scientific and research problems

1

5

.

Controversy and Competition

Source: Analytics India Magazine

Source: Analytics India Magazine

The announcements have not been without controversy. OpenAI's early release of their results, before the agreed-upon date of July 28, drew criticism from the IMO community and other AI companies

3

4

. Additionally, questions were raised about the self-grading of OpenAI's results, as opposed to the official IMO grading received by Google DeepMind

3

4

.

This competition between AI companies highlights the ongoing race for supremacy in the field, with implications for public perception, talent acquisition, and future development

4

.

Future Prospects and Limitations

While these results are promising, some mathematicians and researchers urge caution. Kevin Buzzard of Imperial College London noted that success in the IMO doesn't necessarily translate to readiness for advanced mathematical research

1

. Similarly, Ken Ono from the University of Virginia views AI as valuable research partners but emphasizes that these benchmarks don't fully align with the work of theoretical mathematicians

1

.

Both DeepMind and OpenAI plan to make versions of their models available to researchers in the coming months, potentially opening new avenues for collaboration between AI and human mathematicians

1

5

. However, the full impact of these advancements on mathematical research and problem-solving remains to be seen.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo