4 Sources
4 Sources
[1]
Google Says Deep Think AI Can Partner on Advanced Math, Science
Alphabet Inc. has updated its Gemini Deep Think artificial intelligence model for better performance in math and science research, the company said. After close partnership with researchers, the specialized reasoning model is now able to help scientists move from theoretical reasoning to practical applications, according to a blog post. The AI model uses Google's search to avoid inaccuracies and wrongful citations when doing research, according to a separate blog post. Gemini 3 Deep Think, as the model is called, can also help researchers in chemistry, computer science and physics. The new model is part of a push by the leading AI developers to build more advanced tools that can field everything from complex coding to scientific research. Anthropic, for example, recently released a new version of its most powerful AI model to do both financial research and legal services, leading to a market selloff for traditional software firms. Google built out a math research agent, dubbed Aletheia, that can conduct autonomous research or collaborate with humans. The new agent can also "admit failure to solve a problem," which improved efficiency for researchers, Alphabet's Google said. Google published some of the papers that resulted from the new technology. "Spanning diverse fields -- from information and complexity theory to cryptography and mechanism design -- the results demonstrate how AI is fundamentally shifting research," the company said. Google said it is available in the Gemini app for Google AI Ultra subscribers and also for select researchers, the post said.
[2]
Gemini 3 Deep Think gets 'major upgrade' aimed at practical applications
Deep Think is Gemini's "specialized reasoning mode," and Google today announced a "major upgrade" to let it "solve modern challenges across science, research, and engineering." Google worked with scientists and researchers on this update, with the goal of using Deep Think to "tackle tough research challenges" that "often lack clear guardrails or a single correct solution and data is often messy or incomplete." By blending deep scientific knowledge with everyday engineering utility, Deep Think moves beyond abstract theory to drive practical applications. In terms of benchmarks for this Gemini 3 Deep Think upgrade, Google highlights: This leap in mathematics and competitive coding is joined by boosted performance in chemistry, physics (including theoretical), and other scientific domains. Practical applications for Deep Think allow "researchers to interpret complex data, and engineers to model physical systems through code." With the updated Deep Think, you can turn a sketch into a 3D-printable reality. Deep Think analyzes the drawing, models the complex shape and generates a file to create the physical object with 3D printing. This Gemini 3 Deep Think upgrade is now available in the Gemini app for Google AI Ultra subscribers, while Google is also making it available via the Gemini API (express interest for early access here) for enterprise users.
[3]
Google's Most Intelligent AI Model Just Got Smarter
Select researchers and enterprises can access the model via API Google, on Thursday, updates its Gemini 3 Deep Think artificial intelligence (AI) model. The frontier model was already the company's most intelligent model when it was launched in December 2025. Now, with this upgrade, Google says it can help scientists research challenging problems. The Mountain View-based tech giant highlighted that the update improves its performance across all major benchmarks, but most notably, the model sets new record on the ARC-AGI-2 and Humanity's Last Exam, outperforming both OpenAI's GPT-5.2 and Anthropic's Claude Opus 4.6. Gemini 3 Deep Think Gets Upgraded In a blog post, the tech giant said it is releasing a major upgrade to Gemini 3 Deep Think which will allow it to solve modern challenges across science, research, and engineering. The model continues to be available to the Google AI Ultra subscribers, but now, a select group of researchers and enterprises can also access it via the company's application programming interface (API). Announcing the update, Google CEO Sundar Pichai said, "Gemini 3 Deep Think is getting a significant upgrade. We've refined Deep Think in close partnership with scientists and researchers to tackle tough, real-world challenges." Elon Musk called the development "Impressive," responding to the post. With the improvement, the AI model is claimed to have scored 84.6 percent on the ARC-AGI-2 benchmark, which measures the reasoning capability of frontier models. Google claimed that the score was also verified by the ARC Prize Foundation. It also set a new record by scoring 48.4 percent (without tools) on Humanity's Last Exam, known for being the most difficult benchmark test in existence. Additionally, the company claimed that Gemini 3 Deep Think also achieved Elo score of 3,455 on Codeforces. In each of these tests, the Google model is said to outperform frontier models from OpenAI and Anthropic. Google also shared how some researchers are using the AI model in real-world scientific problems. It highlighted that Lisa Carbone, a mathematician at Rutgers University, used Gemini 3 Deep Think to review a highly technical mathematics paper. She observed that the model successfully identified a subtle logical flaw that had previously passed through human peer review unnoticed.
[4]
Google Gemini 3 Deep Think hits gold medal standards in math and physics olympiads
Google has officially unveiled a major upgrade to Gemini 3 Deep Think, its most sophisticated reasoning model designed to push the boundaries of intelligence in science, research, and engineering. This release marks a transition from general-purpose AI toward a specialized tool capable of navigating the nuances of advanced academia. While standard models often struggle with "messy" data or problems that lack a single clear-cut solution, Deep Think is built to thrive in these gray areas. By blending deep scientific knowledge with algorithmic rigor, Google is positioning this model as a critical collaborator for the global scientific community. Also read: Seedance 2.0: This Chinese AI video tool is outpacing Veo 3 and Sora 2 The most striking achievement of this updated model is its unprecedented performance on the world's most difficult academic benchmarks. Gemini 3 Deep Think has achieved gold-medal level performance in the 2025 International Math Olympiad, proving it can handle the abstract logic and creative problem-solving required at the highest levels of competitive mathematics. This expertise is not limited to numbers alone; the model also demonstrated gold-medal level results on the written sections of the 2025 International Physics and Chemistry Olympiads. These results suggest that the model has moved beyond mere pattern matching and is now capable of deep, first-principles reasoning. Beyond the classroom, the model is setting new industry standards for artificial general intelligence. It recorded a staggering 84.6% on the ARC-AGI-2 benchmark, a test specifically designed to measure fluid intelligence and the ability to learn new concepts on the fly. In the realm of competitive programming, it attained an Elo rating of 3455 on Codeforces, placing it among the elite tier of human coders. Perhaps most impressively, it scored 48.4% on "Humanity's Last Exam," a benchmark composed of questions specifically designed by experts to be nearly impossible for contemporary AI to solve without specialized tools. Also read: Microsoft warning: AI being brainwashed to favour some brands The true value of Gemini 3 Deep Think is already being realized in real-world research environments where human peer review often reaches its limits. At Rutgers University, a team used the model to review a highly technical mathematics paper focusing on the intersection of Einstein's theory of gravity and quantum mechanics. In a field where training data is scarce and the logic is incredibly dense, Deep Think successfully identified a subtle logical flaw that had remained unnoticed during traditional human peer review. This ability to act as a high-level auditor for scientific literature could fundamentally change how academic research is verified and published. Further practical success was seen at Duke University's Wang Lab, where researchers utilized the model to optimize fabrication methods for semiconductor materials. The model successfully designed a precise recipe for growing thin films larger than 100 micrometers, a target that had previously eluded researchers using standard methodologies. By modeling physical systems through code and interpreting complex datasets, Deep Think is proving that its reasoning capabilities have tangible benefits for material science and industrial engineering. Google is also showcasing the model's ability to bridge the gap between abstract design and physical manufacturing. One of the most practical new features allows Deep Think to analyze a simple hand-drawn sketch and transform it into a 3D-printable object. By understanding the geometry and physical requirements of the drawing, the model generates the necessary code to create a functional file for 3D printing. This capability streamlines the prototyping process for engineers, allowing for rapid iteration from a basic concept to a physical part. The updated Gemini 3 Deep Think is now available to Google AI Ultra subscribers within the Gemini app. To ensure this technology reaches the hands of those who can use it most effectively, Google is also launching an early access program for the Gemini API. This allows enterprises, researchers, and independent engineers to integrate these deep reasoning capabilities into their own custom applications. As AI continues to evolve, Google's latest offering suggests that the next great frontier isn't just about faster answers, but about more profound, verified logic that can solve the world's most complex scientific mysteries.
Share
Share
Copy Link
Google has released a major upgrade to Gemini 3 Deep Think, its specialized AI reasoning model designed for scientific research. The update achieves record-breaking scores on benchmarks like ARC-AGI-2 (84.6%) and Humanity's Last Exam (48.4%), outperforming OpenAI's GPT-5.2 and Anthropic's Claude Opus 4.6. Real-world applications include identifying peer review flaws in mathematics papers and optimizing semiconductor fabrication methods.
Google has announced a significant Gemini 3 Deep Think upgrade that positions the AI reasoning model as a partner for solving complex problems across mathematics, chemistry, physics, and engineering. The update, developed in close collaboration with scientists and researchers, shifts the model's focus from abstract theory to practical applications in science that tackle real-world challenges where data is often messy or incomplete and problems lack clear guardrails
1
2
. Google CEO Sundar Pichai emphasized that the company refined Deep Think specifically to address tough, real-world challenges in partnership with the scientific community3
.
Source: Digit
The upgraded Gemini 3 Deep Think has established new industry standards across multiple benchmarks, demonstrating exceptional reasoning capability. The model achieved an impressive 84.6% on the ARC-AGI-2 benchmark, a score verified by the ARC Prize Foundation that measures fluid intelligence and the ability to learn new concepts
3
. On Humanity's Last Exam, considered the most difficult benchmark test in existence, it scored 48.4% without toolsโquestions specifically designed by experts to be nearly impossible for contemporary AI to solve3
4
. The model also attained an Elo rating of 3,455 on Codeforces, placing it among elite human coders3
. In each of these tests, the frontier model outperformed both OpenAI's GPT-5.2 and Anthropic's Claude Opus 4.63
.
Source: 9to5Google
Gemini 3 Deep Think has achieved gold-medal level performance in the 2025 International Math Olympiad, demonstrating it can handle abstract logic and creative problem-solving at the highest competitive levels
4
. The model also demonstrated gold-medal results on the written sections of the 2025 International Physics and Chemistry Olympiads, suggesting it has moved beyond pattern matching to deep, first-principles reasoning4
. This leap in mathematics and competitive coding is joined by boosted performance in chemistry, physics including theoretical domains, and other scientific fields2
.The true value of the upgraded model is already visible in scientific research environments. At Rutgers University, mathematician Lisa Carbone used Gemini 3 Deep Think to review a highly technical mathematics paper focusing on the intersection of Einstein's theory of gravity and quantum mechanics
3
4
. In a field where training data is scarce and logic incredibly dense, the model successfully identified a subtle logical flaw that had remained unnoticed during traditional human peer review3
. At Duke University's Wang Lab, researchers utilized the model to optimize fabrication methods for semiconductor materials, successfully designing a precise recipe for growing thin films larger than 100 micrometersโa target that had previously eluded researchers using standard methodologies4
.Google built out a math research agent dubbed Aletheia that can conduct autonomous research or collaborate with humans on scientific research
1
. The new agent can also "admit failure to solve a problem," which improved efficiency for researchers by avoiding wasted time on unsolvable approaches1
. Google published papers resulting from the new technology spanning diverse fields from information and complexity theory to cryptography and mechanism design, demonstrating how AI is fundamentally shifting research1
. The AI model uses Google's search to avoid inaccuracies and wrongful citations when conducting research1
.Related Stories
One of the most practical new features allows researchers to interpret complex data and engineers to model physical systems through code
2
. With the updated model, users can turn a sketch into a 3D printing-ready fileโDeep Think analyzes the drawing, models the complex shape, and generates a file to create the physical object2
4
. This capability streamlines the prototyping process for engineers, allowing rapid iteration from basic concept to physical part4
.The Gemini 3 Deep Think upgrade is now available in the Gemini app for Google AI Ultra subscribers
1
2
. Google is also making it available via the Gemini API for enterprise users and a select group of researchers, with an early access program for those interested in integrating these deep reasoning capabilities into custom applications2
3
. This release is part of a broader push by leading AI developers to build more advanced tools that can handle everything from complex coding to scientific research, with Anthropic recently releasing a new version of its most powerful AI model for financial research and legal services1
.
Source: Bloomberg
Summarized by
Navi
[3]
04 Dec 2025โขTechnology

01 Aug 2025โขTechnology

12 Dec 2025โขTechnology

1
Technology

2
Policy and Regulation

3
Policy and Regulation
