Perplexity partners with CoreWeave for AI inference on Nvidia-powered infrastructure

3 Sources

Share

Perplexity has signed a multi-year strategic partnership with CoreWeave to power its AI inference workloads using dedicated Nvidia Grace Blackwell clusters. The deal helps CoreWeave demonstrate its ability to attract diverse AI customers beyond raw computing capacity, while CoreWeave stock rose 4% following the announcement.

Perplexity Secures Multi-Year Strategic Partnership with CoreWeave

Perplexity has entered into a multi-year strategic partnership with CoreWeave to handle its growing AI inference workloads on the specialized AI cloud provider's platform

1

. The agreement marks a significant move for both companies as demand for computing power continues to surge across the AI industry. Under the deal, Perplexity will leverage dedicated Nvidia-powered clusters, specifically NVIDIA GB200 NVL72 systems based on the Grace Blackwell architecture, to run its next-generation inference operations

2

. The partnership aims to support Perplexity's Sonar and Search API ecosystem as usage expands, reflecting the company's multi-cloud strategy to ensure optimal performance and reliability.

Source: Axios

Source: Axios

CoreWeave Stock Gains as Company Diversifies Customer Base

CoreWeave stock rose 4% Wednesday following the partnership announcement, signaling investor confidence in the company's ability to attract a broader range of AI customers

2

. CEO Mike Intrator emphasized that the deal reflects "a wider mix of emerging AI leaders adopting the CoreWeave platform," noting that customers choose the company for its unified AI cloud platform rather than just access to raw capacity

1

. This distinction matters as CoreWeave aims to convince Wall Street it can justify heavy spending on new data centers by demonstrating a diversified business model. The company holds the only AI cloud to earn top Platinum ranking in both SemiAnalysis ClusterMAX 1.0 and 2.0, which evaluate cloud infrastructure performance, efficiency, and reliability

2

.

Performance-Driven Infrastructure Decision for AI Inference Workloads

Perplexity's decision to partner with CoreWeave was driven primarily by performance considerations. "Every infrastructure decision traces back to one question: Does this make Perplexity better for our users?" said Dmitry Shevelenko, Perplexity's chief business officer

1

. Perplexity has already begun running AI inference workloads using CoreWeave Kubernetes Service as part of the initial deployment phase, while also utilizing W&B Models for model training, fine-tuning, and management from experimentation to production

2

. Max Hjelm, senior vice president of revenue at CoreWeave, noted that "AI applications running in production require more than just access to raw infrastructure - they require best-in-class performance and reliability as well as a cloud platform designed end-to-end for AI that simplifies compute operations."

CoreWeave Adopts Perplexity Enterprise Max for Internal Operations

In a reciprocal arrangement, CoreWeave will deploy Perplexity Enterprise Max across its organization, enabling employees to search the web and internal knowledge bases, conduct research, visualize data, and access generative AI models within one unified platform

1

. This adoption demonstrates confidence in Perplexity's enterprise capabilities and creates a feedback loop where CoreWeave can directly experience the performance of the AI workloads it hosts. The collaboration positions both companies to capitalize on the surging demand for high performance computing in AI applications.

Specialized AI Cloud Infrastructure Meets Growing Demand

CoreWeave's GPU-first architecture, optimized for training and inference of AI models, has positioned the company as a niche player in a market dominated by generalist cloud giants

3

. With its own data centers located in the United States and Europe, CoreWeave maintains full control over its cloud infrastructure, enabling high performance, low latency, and flexible deployment capabilities. The company's vertical integration from hardware to software, including proprietary GPU management tools for intelligent resource allocation and performance optimization, enhances its competitive position. As CoreWeave attempts to balance tremendous demand for AI computing while avoiding overbuilding, partnerships like this with Perplexity provide validation of its specialized approach to serving production AI systems at scale.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo