Chinese Startup Moonshot AI's Open-Source Kimi K2 Thinking Model Outperforms GPT-5 and Claude 4.5 Sonnet

Reviewed byNidhi Govil

2 Sources

Share

Moonshot AI's new open-source Kimi K2 Thinking model has achieved breakthrough performance, surpassing OpenAI's GPT-5 and Anthropic's Claude 4.5 Sonnet in key reasoning and coding benchmarks. The trillion-parameter model is released under a Modified MIT License, marking a significant milestone in open-source AI competitiveness.

News article

Breakthrough Performance in Open-Source AI

Chinese AI startup Moonshot AI has released Kimi K2 Thinking, an open-source model that has achieved a significant milestone by outperforming leading proprietary AI systems including OpenAI's GPT-5 and Anthropic's Claude 4.5 Sonnet across multiple key benchmarks

1

. The model's release on November 6 marks what industry observers are calling an inflection point for the competitiveness of open AI systems against closed, proprietary alternatives

2

.

The Kimi K2 Thinking model achieved remarkable benchmark scores, including 44.9% on Humanity's Last Exam (HLE), 60.2% on BrowseComp for agentic web-search and reasoning, and 71.3% on SWE-Bench Verified for coding evaluations

1

. These results consistently exceed GPT-5's corresponding scores, with the BrowseComp performance particularly notable as K2 Thinking's 60.2% decisively leads GPT-5's 54.9% and Claude 4.5's 24.1%

1

.

Technical Architecture and Capabilities

Kimi K2 Thinking employs a Mixture-of-Experts (MoE) architecture built around one trillion parameters, with 32 billion parameters activating per inference

1

. The model supports context windows of up to 256,000 tokens and can execute 200-300 sequential tool calls without human intervention, making it particularly suited for complex, multi-step reasoning tasks

2

.

The model's defining capability lies in its explicit reasoning trace, outputting an auxiliary field called reasoning_content that reveals intermediate logic before each final response

1

. This transparency feature ensures visibility across multi-step workflows, addressing growing demands for explainable AI systems

2

.

Competitive Pricing and Accessibility

Despite its trillion-parameter scale, Kimi K2 Thinking maintains competitive pricing at $0.15 per million tokens for cache hits, $0.60 for cache misses, and $2.50 for output tokens

2

. These rates significantly undercut GPT-5's pricing of $1.25 for input and $10 for output, representing an order of magnitude difference in cost while delivering superior performance .

The model is available through multiple channels, including platform.moonshot.ai, kimi.com, and Hugging Face, with weights and code hosted openly

1

. Users can access the model through APIs for chat, reasoning, and multi-tool workflows, as well as try it directly through a ChatGPT-like web interface

1

.

Licensing and Commercial Use

Moonshot AI has released Kimi K2 Thinking under a Modified MIT License that grants full commercial and derivative rights with minimal restrictions

1

. The license includes one notable condition: deployments serving over 100 million monthly active users or generating over $20 million USD per month in revenue must prominently display 'Kimi K2' on the product's user interface

1

. This light-touch attribution requirement preserves the freedoms of standard MIT licensing while ensuring recognition for high-scale commercial applications.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Donโ€™t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

ยฉ 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo