DeepSeek Unveils Updated R1 AI Model, Challenging Industry Giants

DeepSeek Unveils Updated R1 Model

Chinese AI startup DeepSeek has released an update to its R1 reasoning model, dubbed DeepSeek-R1-0528, on the developer platform Hugging Face 1

. This update comes as a "minor trial upgrade" according to the company's WeChat announcement 5

. The updated model, weighing in at 685 billion parameters, is released under a permissive MIT license, allowing for commercial use 1

Source: Analytics Insight

Improved Performance and Reduced Hallucinations

DeepSeek claims that the new R1 update has significantly improved its depth of reasoning and inference capabilities 3

. The company states that the model now has a reduced "hallucination" rate and can perform better in mathematics, programming, and general logic compared to its previous version 4

. DeepSeek asserts that the overall performance of R1-0528 is approaching that of leading models such as OpenAI's O3 and Google's Gemini 2.5 Pro 3

Distilled Version for Broader Accessibility

Alongside the full-sized R1 update, DeepSeek has also released a smaller, "distilled" version called DeepSeek-R1-0528-Qwen3-8B 2

. This model, built using Alibaba's Qwen3-8B as a foundation, is designed to run on a single GPU, making it more accessible for both academic research and industrial development of small-scale models 2

Source: Tom's Guide

Benchmark Performance

The distilled version of R1 has shown impressive performance on certain benchmarks. DeepSeek claims that it outperforms Google's Gemini 2.5 Flash on the AIME 2025 math test and nearly matches Microsoft's Phi 4 reasoning plus model on the HMMT math skills test 2

. Additionally, the LiveCodeBench leaderboard ranks DeepSeek's updated R1 reasoning model just slightly behind OpenAI's O4 mini and O3 reasoning models on code generation, while surpassing xAI's Grok 3 mini and Alibaba's Qwen 3 5

Source: The Verge

Impact on the AI Industry

DeepSeek's R1 model initially shook up the AI industry in January 2025 by rivaling systems from much larger US developers at a fraction of the cost 4

. This latest update further intensifies the competition in the AI space, challenging the view that scaling AI requires vast computing power and investment 5

. In response to DeepSeek's advancements, other tech giants have made strategic moves:

Chinese companies like Alibaba and Tencent have released models claiming to surpass DeepSeek's 5
5
.
Google's Gemini has introduced discounted tiers of access 5
5
.
OpenAI has cut prices and released an O3 Mini model that relies on less computing power 5
5
.

DeepSeek Unveils Updated R1 AI Model, Challenging Industry Giants

DeepSeek Unveils Updated R1 Model

Improved Performance and Reduced Hallucinations

Distilled Version for Broader Accessibility

Benchmark Performance

Impact on the AI Industry

References

DeepSeek updates its R1 reasoning AI model, releases it on Hugging Face | TechCrunch

DeepSeek's distilled new R1 AI model can run on a single GPU | TechCrunch

DeepSeek says a new R1 update is closing the gap with OpenAI o3 and Gemini 2.5 Pro.

DeepSeek Says Upgraded Model Reasons Better, Hallucinates Less

China's DeepSeek releases an update to its R1 reasoning model

Related Stories

DeepSeek-R1: A Game-Changer in AI Reasoning and Cost-Efficiency

DeepSeek V3 Upgrade Challenges AI Giants with Open-Source Efficiency

DeepSeek V3.1: A New Contender in the US-China AI Race

Recent Highlights

X's Paywall Doesn't Stop Grok From Generating Nonconsensual Deepfakes and Explicit Images

Nvidia Vera Rubin architecture slashes AI costs by 10x with advanced networking at its core

OpenAI launches ChatGPT Health to connect medical records to AI amid accuracy concerns

Recent Highlights

Today's Top Stories

Walmart and Google partner on AI shopping through Gemini chatbot with instant checkout

Elon Musk pledges to open source X algorithm in seven days with monthly updates

Google launches Universal Commerce Protocol to power AI agents across shopping platforms

OpenAI asks contractors to upload real work from past jobs to benchmark AI models