Enterprise AI finds its infrastructure home in private cloud as costs and security reshape strategies

2 Sources

Share

A major shift is underway in how enterprises deploy AI at scale. Broadcom's 2026 Private Cloud Outlook reveals that 56% of enterprises now run production AI inference in private cloud, up from 44% last year, while public cloud usage dropped from 56% to 41%. Rising costs, security concerns, and sovereignty requirements are driving the change as AI moves from pilot projects to production workloads.

Enterprise AI Infrastructure Undergoes Major Shift from Public Cloud to Private Cloud

The assumptions that guided early enterprise AI strategies are being rewritten at production scale. When organizations first built AI roadmaps, the default path pointed toward hyperscaler public clouds with ready APIs and expanding GPU capacity

1

. But Broadcom's Private Cloud Outlook 2026 report, based on a survey of 1,800 senior IT leaders across eight countries, reveals a dramatic reversal now showing up in production workloads and capital budgets.

Last year, 56 percent of enterprises used public cloud as the primary environment for production AI inference. This year, that figure has fallen 15 percentage points to 41 percent, while 56 percent of enterprises are now running or planning to run production AI workloads in a private cloud environment

1

. The shift marks a tipping point where AI infrastructure decisions are being driven by the same forces that previously pulled storage, security-sensitive applications, and regulated data into private environments.

AI Infrastructure Costs Drive Urgent Recalculation of Investment Strategies

For the first time in Broadcom's study, cost has overtaken security as the top concern about public cloud

1

. The numbers tell a stark story: 97 percent of IT leaders believe some portion of their public cloud spend is wasted, and more than half say that waste exceeds 25 percent of their total spending. Generative AI and agentic workloads are compounding the pressure, with 62 percent of IT leaders reporting they are very or extremely concerned about AI infrastructure costs

1

.

Source: SiliconANGLE

Source: SiliconANGLE

"Companies are suddenly seeing the cloud answer to AI is not the right answer to run production AI at scale, and inferencing AI, which is really the runtime of your day-to-day operations," explained Paul Turner, chief product officer of the VMware Cloud Foundation Division at Broadcom

2

. "You will not run that on the cloud, because of the cost of operations. You can run it more efficiently, just like you can the rest of your infrastructure, on a private cloud environment."

Enterprises are revising AI investment strategies accordingly. Net intent to increase private cloud investment over three years has risen from 51 percent to 72 percent, and private cloud investment is now growing at more than twice the rate of public cloud

1

. Cost predictability has become the second biggest driver of that shift, cited by 39 percent of organizations. Broadcom's VMware Cloud Foundation platform has already attracted more than 2,000 customers seeking secure and cost-effective AI deployments

2

.

Repatriation Accelerates as AI Workloads Return from Public Cloud

The broader repatriation trend has accelerated sharply: 83 percent of enterprises are now considering repatriation, up from 69 percent in 2025, and half have already moved at least some workloads, a 15-point jump in a single year

1

. More significantly, 43 percent of enterprises actively repatriating workloads are moving AI training, large language models, and inference out of the public cloud—a category that did not exist in last year's study

1

.

When IT leaders place workloads, those classified as high-security, latency sensitive, business critical, or data-intensive consistently land in private cloud. The forces driving this shift—security, control, cost, and governance—did not become more important because of AI, but the consequences of getting them wrong became much harder to absorb at production scale

1

.

AI Sovereignty Emerges as Board-Level Priority Amid Geopolitical Pressures

Geopolitics has moved squarely into the infrastructure conversation. Eighty-six percent of IT leaders say geopolitical and regulatory factors are now directly affecting their IT strategy and operations

1

. Data sovereignty and residency requirements are the top concern, cited by 54 percent of respondents, followed by jurisdiction-specific compliance requirements at 51 percent.

The scope of AI sovereignty has evolved rapidly. Chris Wolf, global head of AI and advanced services for the VMware Cloud Foundation Division at Broadcom, noted that while organizations a year ago focused primarily on filtering data shared with frontier models, today's decisions around data privacy, access control, and auditability are far more nuanced

2

. "For a lot of our customers today, their definition means that it's not just about the data plane being sovereign, it's about the control plane being sovereign," Wolf explained

2

.

Security and compliance remain the single most important factor in workload placement decisions, cited by 32 percent of respondents. AI is adding new obligations on top of existing ones: data protection and privacy (37 percent) and security and control (36 percent) are now the leading infrastructure requirements that AI imposes

1

. Private cloud provides the governance architecture to meet those requirements by design, built in from the start rather than bolted on after deployment.

Security Risks Multiply as AI-Driven Attacks Escalate

"AI is also a risk and cost multiplier," Turner warned. "Just think about a few stats: 73% of enterprises see AI-related attacks. That is almost every industry out there ... actually seeing these new attacks that are driven by AI-enabled software"

2

. The rise in AI-driven attacks is forcing enterprises to reconsider where sensitive workloads should run, with private cloud offering greater control over security posture.

Skills Gap Threatens Production AI Operations

Running production AI at enterprise scale presents an operations challenge as significant as the infrastructure one. The top skills gap cited by IT leaders is AI infrastructure and operations, named by 40 percent of respondents, followed by cloud security operations at 38 percent and Kubernetes operations at 37 percent

1

. This skills gap will shape how quickly organizations can deploy and manage AI workloads at scale, regardless of infrastructure choice.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved