Production AI reshapes private cloud as Broadcom debuts VMware Cloud Foundation 9.1

2 Sources

Share

Broadcom launches VMware Cloud Foundation 9.1, targeting production AI workloads with up to 40% server cost reduction and enhanced security. The move reflects a broader shift as enterprises bring AI closer to enterprise data, with 56% now running or planning production inference on private cloud—while public cloud use drops 15% year-over-year.

Production AI Shifts Infrastructure Priorities Back to Private Cloud

Broadcom has unveiled VMware Cloud Foundation 9.1, positioning private cloud as the practical foundation for secure and cost-effective production AI

2

. The release comes as enterprises move AI workloads from experimental pilots to operational systems, fundamentally changing how organizations evaluate infrastructure for AI workloads

1

. This shift is not just about where workloads run—it centers on cost control, security, governance, and the strategic need to bring AI closer to enterprise data

1

.

Source: CXOToday

Source: CXOToday

According to Broadcom's Private Cloud Outlook 2026 report, 56% of organizations are running or planning to run production inference on private cloud, while public cloud use for production inference dropped to 41%, down 15% year-over-year

2

. The data reveals a practical recalibration: while training and experimentation may still lean on cloud services, inference at scale has different economics. Once inference becomes part of operations, token costs, data gravity, and compliance become boardroom issues rather than technical footnotes.

Generative AI Infrastructure Costs Drive Platform Rethinking

Cost pressure is making infrastructure efficiency a direct lever for AI adoption. Broadcom reports that 62% of IT leaders are very or extremely concerned about generative AI infrastructure costs, while 36% report AI is driving new requirements for data protection, privacy, security controls, and risk management

2

. "AI is also a risk and cost multiplier," said Paul Turner, chief product officer of the VMware Cloud Foundation Division at Broadcom. "Just think about a few stats: 73% of enterprises see AI-related attacks"

1

.

VMware Cloud Foundation 9.1 addresses these concerns with measurable infrastructure improvements. The AI and Kubernetes-native private cloud platform delivers up to 40% reduction in server costs through intelligent resource optimization and advanced memory tiering for clusters running mixed AI and non-AI workloads

2

. Storage total cost of ownership drops up to 39% through enhanced compression and deduplication for AI data pipelines, while Kubernetes operational costs for running AI workloads at scale decrease up to 46%

2

.

Multi-Tenant Infrastructure and Hardware Flexibility for Production Workloads

The platform integrates multi-tenant infrastructure for AI isolation, enabling enterprises and service providers to run multiple AI projects on shared infrastructure with strict security boundaries

2

. This maximizes utilization of expensive GPU and CPU resources while supporting data sovereignty for sensitive models. VCF 9.1 provides mixed compute infrastructure support across AMD, Intel, and NVIDIA, giving enterprises freedom to choose best-of-breed GPU and CPU hardware

2

.

Source: SiliconANGLE

Source: SiliconANGLE

The platform also delivers 4x faster cluster upgrades and 2x increased fleet capacity, now managing up to 5,000 hosts, to rapidly scale AI infrastructure across distributed and air-gapped environments

2

. High-speed networking for AI workloads is enabled through support for NVIDIA ConnectX-7 NICs and NVIDIA BlueField-3 with Enhanced DirectPath I/O, crucial for demanding generative AI workloads

2

.

AI Sovereignty Becomes Deployment Requirement

For regulated environments, AI sovereignty has evolved from abstract policy debate to concrete deployment requirement. Chris Wolf, global head of AI and advanced services for the VMware Cloud Foundation Division at Broadcom, explained that enterprise definitions now extend beyond data plane sovereignty: "It's about the control plane being sovereign. It's that, 'I can disconnect from the internet and I can continue to run. I can continue to operate'"

1

. This governance model matters as organizations deploy local models, frontier models, and AI gateways together depending on sensitivity, cost, and performance requirements.

Prashanth Shenoy, chief marketing officer of the VMware Cloud Foundation Division at Broadcom, noted the shift in enterprise priorities: "Last year, when we did the private cloud outlook study, there was a definitive cloud reset happening in the market, where private cloud and the operating model of private cloud to run your mission-critical workload on-premises or in a hybrid environment was on par with public cloud. Fast-forward to this year, when we did the same survey with 1,800 IT leaders and decision-makers around the globe. A lot of organizations are now moving their AI applications from a pilot phase of trying, to production, doing it at scale"

1

. This transition signals that enterprises are watching infrastructure efficiency, security boundaries, and operational control as key indicators for sustainable AI deployment at enterprise scale.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved