Cursor Launches Composer 2 AI Coding Model to Challenge OpenAI and Anthropic Dominance

6 Sources

Share

AI coding startup Cursor has unveiled Composer 2, a specialized model trained exclusively on coding data that outperforms Claude Opus 4.6 on key benchmarks. Priced at $0.50 per million input tokens, it's 86% cheaper than its predecessor and designed for long-horizon agentic coding tasks. The company later clarified it's built on Moonshot AI's Kimi K2.5 base model after initial omission sparked scrutiny.

Cursor Unveils Composer 2 to Rival OpenAI and Anthropic

Cursor, the San Francisco-based AI coding startup valued at $29.3 billion, has launched Composer 2, a new AI model for software development designed to compete directly with industry giants OpenAI and Anthropic

1

. The AI coding model represents a strategic move by the company to maintain its competitive edge in an increasingly crowded market where AI coding assistants are becoming essential tools for developers

2

.

Source: Bloomberg

Source: Bloomberg

The programming-optimized model is now available inside Cursor's agentic AI coding environment and serves the company's more than 1 million daily users, including 50,000 businesses such as Stripe Inc. and Figma Inc.

1

.

Source: SiliconANGLE

Source: SiliconANGLE

Aman Sanger, Cursor's co-founder who leads the research team, emphasized that Composer 2 was trained solely on coding-related data, creating a smaller, more specialized model focused exclusively on software development workflows

1

.

Dramatic Cost Reduction and Performance Gains

Composer 2 delivers significant economic advantages over its predecessor, priced at $0.50 per million input tokens and $2.50 per million output tokens—representing an 86% cost reduction compared to Composer 1.5, which cost $3.50 per million input tokens and $17.50 per million output tokens

2

. The company also launched Composer 2 Fast, a higher-priced but faster variant at $1.5 per million input tokens and $7.5 per million output tokens, which is roughly 57% cheaper than Composer 1.5

2

.

The model supports prompts with up to 200,000 tokens and can generate code, fix bugs, and interact with command line interfaces

3

. Performance improvements are substantial: Composer 2 achieved 61.3 on CursorBench, 61.7 on Terminal-Bench 2.0, and 73.7 on SWE-bench Multilingual, compared to Composer 1.5's scores of 44.2, 47.9, and 65.9 respectively

2

.

Beating Anthropic but Trailing OpenAI on Key Coding Benchmarks

On Terminal-Bench 2.0, which measures how well an AI agent performs tasks in command line terminal-style interfaces, Composer 2 outperformed Anthropic's Claude Opus 4.6, which scored 58.0

2

. However, OpenAI's GPT-5.4 still leads at 75.1, positioning Composer 2 as a strong mid-tier competitor rather than the absolute benchmark leader

3

.

The model's strength lies in its optimization for long-horizon coding tasks—problems requiring hundreds of actions across multiple files, command executions, and iterative debugging

2

. Cursor's documentation describes the model as tuned specifically for tool use, file edits, and terminal operations within its platform, addressing one of the biggest challenges in coding AI: maintaining reliability across extended workflows rather than just isolated code generation

2

.

Kimi K2.5 Base Model Controversy and Transparency Questions

Days after launch, Cursor faced scrutiny when users discovered the company had not initially disclosed that Composer 2 was built on Moonshot AI's Kimi K2.5 base model

4

. Aman Sanger acknowledged the oversight in a post on X, stating, "We've evaluated a lot of base models on perplexity-based evals, and Kimi K2.5 proved to be the strongest"

4

.

Source: ET

Source: ET

Sanger explained that Composer 2 is built on top of the base model with further training, fine-tuning using reinforcement learning, and supporting systems for efficiency

4

. He admitted, "It was a miss to not mention the Kimi base in our blog from the start," and pledged to correct this in future releases

4

. The disclosure gap raises questions about transparency standards in AI development, particularly as companies compete to differentiate their offerings.

Technical Innovation Through Self-Summarization and Reinforcement Learning

Cursor employed self-summarization, a machine learning method that compresses information to fit within context window limits during training data processing . The company states that quality gains stem from its first continued pretraining run, which provided a stronger foundation for scaled reinforcement learning focused on agentic coding tasks

2

.

Composer 2 can access Cursor's agent tool stack, including semantic code search, file and folder search, file reads and edits, shell commands, browser control, and web access

2

. This tight integration positions the model as a Cursor-native release rather than a broadly distributed standalone offering, which may limit its addressable audience but enhances its effectiveness within the platform's ecosystem

2

.

Market Position and Funding Ambitions

Cursor has been in talks to raise a new funding round at approximately $50 billion valuation, though plans remain in early stages

1

5

. The company previously raised $900 million in June 2025 at a $9.9 billion valuation, led by Thrive Capital with participation from Andreessen Horowitz, Accel, and DST Global . The startup reached $100 million in annual recurring revenue in January 2025, demonstrating rapid commercial traction .

For developers evaluating whether to adopt Composer 2, the decision hinges less on raw benchmark superiority and more on whether they value a model optimized specifically for Cursor's product experience. The cost-to-performance tradeoff appears favorable compared to frontier models from OpenAI and Anthropic, particularly for teams already embedded in Cursor's development environment. As AI coding tools mature, the competitive landscape will likely reward specialized models that balance performance, cost, and workflow integration rather than pursuing universal benchmark dominance.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo