Baseten raises $1.5 billion at $13 billion valuation as AI inference demand explodes

3 Sources

Share

Baseten is finalizing a $1.5 billion funding round that values the AI inference platform at up to $13 billion, more than doubling its $5 billion January valuation. The dual-tiered structure allows some investors to buy in at $11 billion while others pay $13 billion, reflecting surging demand for inference as companies shift workloads to open-source AI models that cost a fraction of proprietary alternatives.

Baseten Secures $1.5 Billion in Dual-Tiered Funding Round

Baseten is finalizing a $1.5 billion funding round that values the AI inference platform at up to $13 billion, marking one of the most dramatic valuations in recent AI infrastructure history

1

. The round features an unusual dual-tiered structure where some investors are buying shares at an $11 billion valuation while others are paying $13 billion, according to reports

2

3

. This tactic, increasingly common among AI startups, allows companies to lift headline numbers while accommodating different investor risk appetites. The financing is co-led by Altimeter Capital, Conviction, Spark Capital, Sands Capital, and Wellington Management

1

.

Source: PYMNTS

Source: PYMNTS

Explosive Growth Driven by Surging Demand for Inference

The Baseten funding represents a remarkable acceleration in valuation. The San Francisco-based company was valued at $5 billion as recently as January when it raised a $300 million Series E backed by IVP, CapitalG, and Nvidia

1

3

. The new deal more than doubles that valuation in roughly five months. Just nine months earlier, in September 2025, Baseten was worth about $2.15 billion

3

. This rapid re-rating reflects explosive revenue growth, with annualized revenue reaching around $600 million at the end of the first quarter, compared to $200 million at the beginning of the quarter

3

.

Open-Source AI Models Reshape the AI Stack Economics

Baseten sells what founder and CEO Tuhin Srivastava describes as the unglamorous but essential part of the AI stack

1

. Founded in 2019, the company provides the software and computing capacity businesses need to run AI model inference—the step where a trained model actually answers a query. Baseten rents capacity from around 20 cloud providers and layers its own inference software on top, allowing customers to deploy and fine-tuning models on their own data

1

3

. Much of this work runs on cheaper open-source AI models rather than frontier systems from OpenAI or Anthropic. "Open-source models are getting very, very good," Srivastava told the Wall Street Journal, noting that customers increasingly mix open and proprietary models depending on task complexity

1

. Some customers have cut costs sharply by shifting inference workloads to open-source options, with one reportedly running tasks at about 30 percent of the cost of proprietary alternatives

1

.

Source: SiliconANGLE

Source: SiliconANGLE

Technical Infrastructure Powers Cost-Effective AI Solutions

The Baseten AI inference platform automates the complex workflow of setting up cloud-based inference clusters, eliminating the need for developers to manually provision graphics cards, configure them, and install numerous software tools

2

. The platform is powered by three core inference engines optimized for different AI architectures. BIS-LLM handles LLMs with mixture of experts architecture by optimizing KV cache and automatically provisioning hardware as token usage increases. Engine-Builder-LLM targets dense LLMs using lookahead decoding to generate multiple tokens simultaneously. BEI serves simpler models including embedding, classification, and search models

2

. A software module called MCM spreads workloads across multiple cloud providers, rerouting prompts during outages and addressing graphics card shortages. Customers can deploy custom algorithms using Truss, which automates packaging LLMs into Baseten-compatible formats

2

.

The Picks-and-Shovels Bet Amid an AI Inference Price War

The raise positions Baseten at the center of an AI infrastructure gold rush where the layer making models run efficiently has become among the most sought-after in the industry

1

. Customers including Cursor, Mercor, and OpenEvidence use Baseten to serve models

1

. Yet the funding also arrives amid intensifying price competition. Many widely-used open-source models now originate from China, including DeepSeek and Moonshot AI, while even Nvidia has released an open family called Nemotron

1

. With OpenAI reportedly weighing steep price cuts, that pressure threatens frontier labs while theoretically benefiting companies helping businesses run cheaper alternatives. A $13 billion valuation on a company at the center of a price war represents a bet that the infrastructure providers outlast the commodity collapse. As Baseten's co-founders wrote in January, "As AI becomes embedded in every product and workflow, inference isn't just growing—it's becoming one of the largest markets ever created"

3

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved