2 Sources
[1]
DeepSeek permanently reduces the price of its flagship V4 model by 75 percent - Engadget
The lower prices could be aimed at undercutting the competition. DeepSeek is leaning hard into being the "cost-effective" choice for AI agents. According to its website, the Chinese startup is dropping the price for its latest flagship model, DeepSeek V4 Pro, to a fourth of its original price. This latest price update makes permanent the 75 percent discount promotion that was previously supposed to end on May 31, 2026. As seen on the website's pricing page, the DeepSeek V4 Pro prices now range from $0.003625 to $0.87 per one million tokens, compared to the previous range between $0.0145 to $3.48 for every million tokens. The company's decision to permanently reduce the price comes a month after it released its V4 models, Pro and Flash, which it claimed would welcome the "era of cost-effective 1M context length." DeepSeek's deep discounts should prove to be a major cost savings for enterprise accounts or power users who go through millions of tokens in a day. The major price drop also presents a more affordable alternative to other popular AI models, like OpenAI's GPT-5 or the recently released Gemini 3.5 Flash from Google. DeepSeek's price undercutting might provoke competitors, like Anthropic, who previously accused the Chinese company of "distillation attacks" that improperly learn from Claude's more capable AI models.
[2]
DeepSeek made its 75% discount permanent. The AI price war just escalated.
DeepSeek permanently cut V4 Pro prices by 75%, to $0.87 per million output tokens. It undercuts GPT-5, Gemini, and Claude. DeepSeek has made permanent the 75% price discount on its flagship V4 Pro model. The promotion was originally scheduled to expire on 31 May. The Chinese AI startup's pricing now ranges from $0.003625 to $0.87 per million tokens, down from $0.0145 to $3.48. The price points are striking in context. OpenAI's GPT-5 charges $2.50 per million input tokens and $10 per million output tokens. Anthropic's Claude Opus 4.7 is priced at $5 input and $25 output. Google's Gemini 3.5 Flash, its cost-optimised model, charges $0.15 input and $0.60 output per million tokens. DeepSeek V4 Pro's new permanent pricing sits below all of them. The gap is widest against the frontier reasoning models that enterprise customers rely on for demanding workloads. The decision to lock in the discount one month after launching the V4 models suggests DeepSeek is prioritising market share over per-unit revenue. The company described V4 as welcoming the "era of cost-effective 1M context length." It is positioning its models as the default for applications that process large documents, codebases, or conversational histories where token costs compound fast. For enterprise accounts consuming millions of tokens daily, the savings are material. Salesforce projects $300 million in Anthropic token spending this year. At DeepSeek's new pricing, an equivalent volume would cost a fraction of that figure. The question for enterprise buyers is whether DeepSeek's model quality, reliability, and compliance posture justify the switch. The price advantage may be offset by the geopolitical and technical risks of routing sensitive workloads through a Chinese AI provider. That calculus varies by industry and by the sensitivity of the data involved. The competitive dynamics are complicated by Anthropic's public accusation that DeepSeek has engaged in "distillation attacks." The allegation is that DeepSeek improperly trained on Claude's responses to improve its own models. DeepSeek has not publicly addressed the accusation in detail. If substantiated, it would mean that some of DeepSeek's capability advantage was built on Anthropic's research investment. The price differential would then reflect intellectual property arbitrage rather than engineering efficiency. The accusation remains unresolved. Anthropic's annualised revenue surged from $9 billion to $30 billion between the end of 2025 and early April 2026. That growth was driven largely by enterprise adoption of Claude Code. DeepSeek's pricing pressure threatens the revenue-per-token economics that support Anthropic's valuation trajectory. If enterprise customers begin routing lower-complexity tasks to DeepSeek while reserving Claude for high-stakes reasoning, Anthropic's token volume could hold while revenue per token declines. The broader AI pricing landscape has been moving toward commoditisation throughout 2026. Google has repeatedly cut Gemini prices to compete with open-weight models. OpenAI's pivot toward consumer platform features, including personal finance tools and advertising, reflects a recognition that API token revenue alone may not sustain its $852 billion valuation. DeepSeek's permanent price cut accelerates a trend that was already compressing margins across the industry. The era of high-margin AI tokens may be ending faster than anyone expected. DeepSeek V4 Pro supports a one-million-token context window at the new pricing. That makes it competitive for document analysis, legal review, and codebase comprehension. These are the long-context applications where input cost is the binding constraint on adoption. The combination of frontier-adjacent capability and radically lower pricing creates a genuine dilemma for CTOs. The cheapest option is also the one with the most geopolitical complexity. It has the least transparency about training data provenance and an unresolved IP accusation from one of its most capable competitors. DeepSeek's strategy appears to be that price will win. Enough volume will flow to the cheapest capable model regardless of origin. The geopolitical concerns that constrain adoption in government and regulated industries will not prevent adoption in the broader market. Whether that bet is correct depends on whether Western AI companies can close the price gap before DeepSeek closes the capability gap. The alternative is that the market bifurcates into a Western tier and a Chinese tier with fundamentally different economics. DeepSeek just made sure the gap between them got wider.
Share
Copy Link
DeepSeek has permanently slashed the price of its flagship V4 Pro model by 75 percent, making it one of the most affordable AI models on the market. The Chinese startup now charges $0.003625 to $0.87 per million tokens, significantly undercutting competitors like OpenAI's GPT-5, Google's Gemini 3.5 Flash, and Anthropic's Claude. The aggressive pricing strategy could reshape enterprise AI economics but raises questions about geopolitical risks and unresolved intellectual property concerns.
DeepSeek has made permanent its 75 percent discount on the flagship DeepSeek V4 Pro model, a move that dramatically escalates the AI price war among major providers
1
. The Chinese AI startup's price reduction was originally scheduled to expire on May 31, 2026, but the company has now locked in the discount indefinitely2
. According to its website, DeepSeek V4 Pro now costs between $0.003625 to $0.87 per one million tokens, down from the previous range of $0.0145 to $3.481
. This aggressive pricing positions DeepSeek as a cost-effective alternative to established language model providers, potentially transforming how enterprise users approach AI spending.
Source: Engadget
The price reduction puts DeepSeek well below competing models from OpenAI, Google, and Anthropic. OpenAI's GPT-5 charges $2.50 per million input tokens and $10 per million output tokens, while Anthropic's Claude Opus 4.7 is priced at $5 input and $25 output
2
. Even Google's Gemini 3.5 Flash, positioned as a cost-optimized model, charges $0.15 input and $0.60 output per million tokens—still higher than DeepSeek's new pricing2
. The gap is widest against frontier reasoning models that enterprise customers rely on for demanding workloads. For organizations consuming millions of tokens daily, the cost savings are substantial. Salesforce projects $300 million in Anthropic token spending this year; at DeepSeek's new pricing, an equivalent volume would cost a fraction of that figure2
.The decision to make the discount permanent just one month after launching the V4 models suggests DeepSeek is prioritizing its ability to win market share over per-unit revenue
2
. The company described V4 as welcoming the "era of cost-effective 1M context length," positioning its models as the default for applications that process large documents, codebases, or conversational histories where token costs compound rapidly1
. DeepSeek V4 Pro supports a one-million-token context window at the new pricing, making it competitive for document analysis, legal review, and codebase comprehension—applications where input cost is the binding constraint on adoption2
.Related Stories
While the pricing is attractive, the question for enterprise users is whether DeepSeek's model quality, reliability, and compliance posture justify the switch
2
. The price advantage may be offset by geopolitical concerns of routing sensitive workloads through a Chinese AI provider, a calculus that varies by industry and data sensitivity2
. Adding complexity, Anthropic has publicly accused DeepSeek of engaging in "distillation attacks," alleging that DeepSeek improperly trained on Claude's responses to improve its own models—a form of intellectual property infringement1
. If substantiated, it would mean that some of DeepSeek's capability advantage was built on Anthropic's research investment, with the price differential reflecting intellectual property arbitrage rather than engineering efficiency2
. DeepSeek has not publicly addressed the accusation in detail, and the allegation remains unresolved2
.DeepSeek's permanent price cut accelerates a trend toward commoditization that has been compressing margins across the AI industry throughout 2026
2
. Google has repeatedly cut Gemini prices to compete with open-weight models, while OpenAI's pivot toward consumer platform features, including personal finance tools and advertising, reflects recognition that API token revenue alone may not sustain its $852 billion valuation2
. Anthropic's annualized revenue surged from $9 billion to $30 billion between the end of 2025 and early April 2026, driven largely by enterprise adoption of Claude Code2
. DeepSeek's pricing pressure threatens the revenue-per-token economics that support Anthropic's valuation trajectory. If enterprise customers begin routing lower-complexity tasks to DeepSeek while reserving Claude for high-stakes reasoning, Anthropic's token volume could hold while revenue per token declines2
. Whether DeepSeek's bet on price succeeds depends on whether Western AI companies can close the price gap before DeepSeek closes the capability gap, or whether the market bifurcates into Western and Chinese tiers with fundamentally different economics2
.Summarized by
Navi
[1]
1
Technology

2
Technology

3
Science and Research
