Baseten Raises $1.5B at $13B Valuation for AI Inference

Baseten Secures $1.5 Billion in Dual-Tiered Funding Round

Baseten is finalizing a $1.5 billion funding round that values the AI inference platform at up to $13 billion, marking one of the most dramatic valuations in recent AI infrastructure history1

. The round features an unusual dual-tiered structure where some investors are buying shares at an $11 billion valuation while others are paying $13 billion, according to reports2

. This tactic, increasingly common among AI startups, allows companies to lift headline numbers while accommodating different investor risk appetites. The financing is co-led by Altimeter Capital, Conviction, Spark Capital, Sands Capital, and Wellington Management1

Source: PYMNTS

Explosive Growth Driven by Surging Demand for Inference

The Baseten funding represents a remarkable acceleration in valuation. The San Francisco-based company was valued at $5 billion as recently as January when it raised a $300 million Series E backed by IVP, CapitalG, and Nvidia1

. The new deal more than doubles that valuation in roughly five months. Just nine months earlier, in September 2025, Baseten was worth about $2.15 billion3

. This rapid re-rating reflects explosive revenue growth, with annualized revenue reaching around $600 million at the end of the first quarter, compared to $200 million at the beginning of the quarter3

Open-Source AI Models Reshape the AI Stack Economics

Baseten sells what founder and CEO Tuhin Srivastava describes as the unglamorous but essential part of the AI stack1

. Founded in 2019, the company provides the software and computing capacity businesses need to run AI model inference—the step where a trained model actually answers a query. Baseten rents capacity from around 20 cloud providers and layers its own inference software on top, allowing customers to deploy and fine-tuning models on their own data1

. Much of this work runs on cheaper open-source AI models rather than frontier systems from OpenAI or Anthropic. "Open-source models are getting very, very good," Srivastava told the Wall Street Journal, noting that customers increasingly mix open and proprietary models depending on task complexity1

. Some customers have cut costs sharply by shifting inference workloads to open-source options, with one reportedly running tasks at about 30 percent of the cost of proprietary alternatives1

Source: SiliconANGLE

Technical Infrastructure Powers Cost-Effective AI Solutions

The Baseten AI inference platform automates the complex workflow of setting up cloud-based inference clusters, eliminating the need for developers to manually provision graphics cards, configure them, and install numerous software tools2

. The platform is powered by three core inference engines optimized for different AI architectures. BIS-LLM handles LLMs with mixture of experts architecture by optimizing KV cache and automatically provisioning hardware as token usage increases. Engine-Builder-LLM targets dense LLMs using lookahead decoding to generate multiple tokens simultaneously. BEI serves simpler models including embedding, classification, and search models2

. A software module called MCM spreads workloads across multiple cloud providers, rerouting prompts during outages and addressing graphics card shortages. Customers can deploy custom algorithms using Truss, which automates packaging LLMs into Baseten-compatible formats2

The Picks-and-Shovels Bet Amid an AI Inference Price War

The raise positions Baseten at the center of an AI infrastructure gold rush where the layer making models run efficiently has become among the most sought-after in the industry1

. Customers including Cursor, Mercor, and OpenEvidence use Baseten to serve models1

. Yet the funding also arrives amid intensifying price competition. Many widely-used open-source models now originate from China, including DeepSeek and Moonshot AI, while even Nvidia has released an open family called Nemotron1

. With OpenAI reportedly weighing steep price cuts, that pressure threatens frontier labs while theoretically benefiting companies helping businesses run cheaper alternatives. A $13 billion valuation on a company at the center of a price war represents a bet that the infrastructure providers outlast the commodity collapse. As Baseten's co-founders wrote in January, "As AI becomes embedded in every product and workflow, inference isn't just growing—it's becoming one of the largest markets ever created"3

Baseten raises $1.5 billion at $13 billion valuation as AI inference demand explodes

Baseten Secures $1.5 Billion in Dual-Tiered Funding Round

Explosive Growth Driven by Surging Demand for Inference

Open-Source AI Models Reshape the AI Stack Economics

Technical Infrastructure Powers Cost-Effective AI Solutions

The Picks-and-Shovels Bet Amid an AI Inference Price War

References

Baseten raises $1.5bn at up to $13bn for AI inference

AI inference provider Baseten reportedly raising $1.5B in funding

Baseten Nears $1.5 Billion Funding Round as Inference Demand Surges | PYMNTS.com

Related Stories

Baseten Secures $150 Million in Series D Funding, Reaching $2.15 Billion Valuation

Baseten Secures $75M Funding to Boost AI Inference Performance and Efficiency

Nvidia invests $150 million in Baseten as AI industry shifts from training to inference

Recent Highlights

OpenAI releases GPT-5.6 models after government review, unveils ChatGPT Work to compete in AI agent race

Apple sues OpenAI over alleged trade secret theft as hardware rivalry intensifies

Meta's new AI image generator can create deepfakes from public Instagram photos without notice

Recent Highlights

Today's Top Stories

200+ Economists Warn AI Economic Impact Could Dwarf Industrial Revolution in Just Years

Waze integrates Google Gemini AI with personalized navigation and motorcycle-focused updates

Satya Nadella warns companies using AI are paying twice: once in cash, once in secrets

Samsung Health forces users to choose between AI training consent and losing their data