
2 Sources
2 Sources
[1]

High-performance computing innovations: Key insights from SC24 - SiliconANGLE
High-performance computing innovations are redefining the future of enterprise computing, pushing the boundaries of scalability, sustainability and innovation. At the heart of this transformation is the emergence of scalable AI infrastructure, which is democratizing supercomputing and making advanced technologies accessible to enterprises of all sizes, according to John Furrier, executive analyst at theCUBE Research. "I think this year you're starting to see real build-out around the infrastructure hardware and where hardware is turning into systems," Furrier said in during the recent SC24 event. "You're going to start to see the game change, and then the era's here, the chapter's closed, the old IT is over and the new systems are coming in." Furrier and fellow theCUBE Research analysts Dave Vellante and Savannah Peterson spoke with tech leaders in AI and high-performance computing at SC24, during an exclusive broadcast on theCUBE, SiliconANGLE Media's livestreaming studio. Discussions centered on how AI-driven innovation shapes scalable infrastructure, sustainability practices and quantum computing's future role in data center architectures. (* Disclosure below.) Here are three key insights you may have missed from theCUBE's coverage: As enterprises adopt flexible, open systems, collaborations across the tech industry address the challenges of power consumption and cost. Partnerships such as those between Super Micro Computer Inc. and WekaIO Inc. exemplify high-performance computing innovations, pioneering energy-efficient AI data centers. These collaborations ensure sustainability remains a core principle of scalability, according to Nilesh Patel, chief product officer at Weka; Patrick Chiu, senior director of storage product management at Supermicro; and Ian Finder, group product manager of accelerated computing at Nvidia Corp. "As we continue to see the build-out [of AI data centers], two challenges are happening," Patel told theCUBE during the event. "One is the power consumption; the power requirement in data centers is growing like crazy. The second thing is now we are getting into influencing space where it's becoming a token economy. The cost token for dollars, tokens per wattage use and so on ... have become our important key performance indicators. We got together with Nvidia and Supermicro and tried to attack one of the core problems that is becoming the Achilles heel for data center growth, particularly for AI infrastructure." AI's exponential growth has pushed traditional computing frameworks to their limits, making clustered systems essential for scaling modern workloads, according to Hasan Siraj, head of software products, ecosystem, at Broadcom Inc. Networking advances reflect high-performance computing innovations, serving as the glue connecting these clusters, enabling efficient training of large language models while addressing latency and bandwidth challenges. "If you are training a large model and these models are growing at an exponential, they don't fit in a central processing unit, and a core of a CPU, virtualization is no play," Siraj said during the event. "This is why you cannot fit a model within a server or two servers or four servers. That is why you need a cluster. When you have a cluster and everything is spread out, you need glue to put this all together. That is networking." Building on scalable clusters, open hardware solutions provide enterprises the flexibility to tailor infrastructure to diverse workloads. These systems break free from proprietary lock-in, delivering cost-effective options for scaling AI operations while optimizing resource usage, according to Steen Graham, chief executive officer of Metrum AI Inc. and Manya Rastogi, technical marketing engineer at Dell. "I think right now with AI, we've really kind of optimized software in a great way," Graham said during an interview at the event. "We're building this really systematic software with AI workers that will save people material [and] time and ultimately drive topline revenue and getting enterprises to really high-fidelity solutions." Here's the complete video interview with Patrick Chiu, Nilesh Patel and Ian Finder: The evolution of artificial intelligence demands modular systems that prioritize efficiency, flexibility and scalability. Broadcom, Dell and Denvr Dataworks Inc. exemplify this approach with AI factories designed for compact, energy-efficient operations. These modular superclusters integrate over 1,000 GPUs in under 900 square feet, leveraging advanced liquid immersion cooling to optimize power usage and space, according to Broadcom's Hasan Siraj; Vaishali Ghiya, executive officer of global ecosystems and partnerships at Denvr Dataworks; and Arun Narayanan, senior vice president of compute and networking product management at Dell. "AI workloads are very power-hungry," Ghiya told theCUBE during the event. "That is exactly why we designed our Denvr Dataworks private zone, in partnership with Broadcom and Dell, so that we can give customers different choices and options as well as open architecture. Liquid immersion cooling, as well as liquid to the chip cooling, really results in the efficient power usage as well as a compact footprint." Decentralization further reshapes enterprise AI infrastructure, providing sustainable alternatives that challenge traditional hardware dependency. Organizations can optimize their hardware setups by embracing multi-vendor ecosystems with diverse solutions, such as Advanced Micro Devices Inc. GPUs. These integrations enable high-performance computing innovations for customized AI workloads while fostering innovation, according to Saurabh Kapoor, director of product management and strategy at Dell Technologies, and Jon Stevens, chief executive officer of Hot Aisle Inc. "The thing that I think that we're going to focus on is just continuously releasing whatever's [the] latest and greatest, working with Dell, working with AMD [and] working with Broadcom to continuously make this latest and greatest hardware available to developers, to anyone, and support them with that," Stevens told theCUBE during the event. Data intelligence underpins the success of these modular systems, transforming raw data into actionable insights that drive scalability. By ingesting, analyzing and delivering insights across diverse data types, platforms such as DataDirect Networks Inc. enhance AI performance and adapt to evolving business needs, according to Alex Bouzari, co-founder and chief executive officer of DDN. "The industry is completely transforming -- it's all about AI," Bouzari told theCUBE during the event. "You have to be able to ingest the data, images, audio, text [and] video from lots of different sources. You have to be able to analyze it, process it, gain insight from it and then deliver that insight to organizations who will then benefit from it. And we are at the core of it. We are the data intelligence platform that propels the growth of AI across industries and marketing." Here's the complete video interview with Alex Bouzari: Direct liquid cooling has emerged as a critical solution to manage the intense heat generated by powerful CPUs and GPUs, maintaining performance at scale while supporting high-performance computing innovations. The rise of exascale computing redefines the boundaries of high-performance computing, enabling massive data processing with unparalleled efficiency. However, this progress introduces significant challenges in thermal management, necessitating advanced cooling technologies, according to Armando Acosta (pictured), director of HPC product management at Dell. Direct liquid cooling has emerged as a critical solution to manage the intense heat generated by powerful CPUs and GPUs, maintaining performance at scale while supporting high-performance computing innovations. "If you look at the rise of exascale, what you're starting to see now is with the rise of exascale and these large machines and HPC supercomputers, guess what? New challenges arise when you try to go to that scale," Acosta said during the event. "When you look at exascale, what it's driving is more direct liquid cooling technologies. If you want the highest performance, you want the best CPU or the highest performing GPU ... you have to do direct liquid cooling." As artificial intelligence workloads expand, networking infrastructure must evolve to support high throughput and low-latency demands. Unlike traditional data centers, AI architectures require clusters of GPUs functioning cohesively as a single computational unit. This integration unlocks the potential of AI operations while driving high-performance computing innovations that deliver both efficiency and business value, according to Scott Bils, vice president of product management, professional services, at Dell. "The key to driving outcomes and business value from gen AI is data," he said during the event. "That's where the role of AI networking becomes so critical. When you think about AI networking and the role it plays in data, when you think about clusters and AI architectures, they're fundamentally different than traditional data center networking. When you think about clusters of GPUs, you essentially want the clusters at a rack level, or even a data center level, to function as a single computer ... a single brain." To sustain long-term AI scalability, organizations must address growing demands on energy and infrastructure through tailored solutions, Bils noted. Automating data pipelines and employing AI-specific data catalogs improve performance and sustainability by streamlining access and ensuring compliance. "As enterprise deployments begin to scale out, they're going to face and are facing similar issues," Bils said. "Helping them think through the overall design architecture, not just for today, but going forward as they scale out the environment, is a big part of the capability we bring -- then, the expertise from Nvidia and our other partners in the space as well." Here's the complete interview with Armando Acosta: To watch more of theCUBE's coverage of SC24, here's our complete event video playlist:
[2]

Multicloud AI strategies and security innovations at Ignite 2024 - SiliconANGLE
Multicloud AI strategies are revolutionizing how organizations approach secure, scalable and integrated artificial intelligence solutions across complex technology landscapes. As businesses prioritize seamless connectivity and robust data access, industry leaders at Microsoft Ignite 2024 spotlighted innovative approaches to simplifying AI deployment and enhancing productivity. Simplifying connectivity has become a critical focus due to the growing need for consistent access to data and applications in multicloud environments, according to theCUBE Research's Rob Strechay and Bob Laliberte. "You're starting to see a greater awareness that the network is not just basic plumbing," Laliberte said during an analyst segment at Microsoft Ignite. "It's required for these highly distributed environments and especially when you're going to cloud and multicloud environments. That connectivity is going to be critical to ensure that all the data gets to where it needs to go and that people can access the information they need to access." Strechay and Laliberte spoke with industry-leading technology and cloud innovators at Microsoft Ignite 2024, during an exclusive broadcast on theCUBE, SiliconANGLE Media's livestreaming studio. Discussions explored advancements in AI simplification, strategies enabling teams to independently leverage AI and upskilling initiatives designed to enhance productivity and adoption. (* Disclosure below.) Here are three key insights you may have missed from theCUBE's coverage: As AI and hybrid cloud infrastructures evolve, multicloud AI strategies remain a top priority for enterprises navigating complex multicloud environments. Dell Technologies Inc. has emphasized zero-trust security measures with its APEX Protection Services for Microsoft Azure, integrating data immutability, encryption and multi-factor authentication to safeguard customer data. These measures offer resilience alongside robust protection, according to Varun Chhabra (pictured), senior vice president of product marketing at Dell. "It delivers enterprise-grade protection for customers and their data, but it has a unique focus on resilience," he told theCUBE during the event. "With APEX Protection Services from Microsoft Azure, what customers get is secure storage. There are multiple layers of zero-trust security built into it, data immutability, encryption, multi-factor authentication and role-based access controls." Aviatrix Systems Inc. contributes to multicloud AI strategies by enhancing security measures tailored for AI workflows, according to Chris McHenry, senior vice president of product marketing at Aviatrix. The company provides customers with streamlined operations and robust security for AI applications in multicloud landscapes through integrations with Microsoft Security Copilot and Secure Service Edge. "Azure is going to be changing the way that applications access the internet," McHenry said during the event. "Organizations are going to have to make a choice as to how they implement internet access. They call this the default egress change. We think that we have one of the best options out there." Here's theCUBE's complete video interview with Varun Chhabra: Partnerships between cloud providers and technology leaders are unlocking new possibilities for AI-driven hybrid cloud infrastructures. Nutanix Inc. has partnered with Microsoft Azure to create a hybrid AI infrastructure that enables seamless operation across on-premises and cloud environments. This partnership delivers flexibility for industries such as healthcare and finance by supporting multicloud AI strategies that operate efficiently from edge locations to virtual private data centers, according to Lee Caswell, senior vice president of product and solutions marketing at Nutanix. "Apps are now the competitive currency," he said during the event. "That's part of the Nutanix Enterprise AI solution. With that, you get your choice of [large language models] fully certified down to [graphics processing unit]-enabled servers from all of our top partners. Now you can go and run that right across the full hybrid experience from the edge into the virtual private data center and then now connect it into Microsoft Azure." Similarly, Hitachi Vantara LLC is advancing its collaboration with Microsoft to deliver an AI-ready platform for hybrid cloud environments, according to Rollen Roberson, vice president of hyperscalers and cloud at Hitachi Vantara. The Hitachi Unified Compute Platform for Azure Stack HCI enhances cloud management capabilities while supporting AI application development in regulated industries. This effort aims to balance on-premises performance with cloud scalability. "AI is a very customized type of workload that is specific to the business that our customers are in," Roberson said during the event. "To help with that journey, we've teamed up with our sister company, Hitachi Solutions ... one of Microsoft's premier services companies that helps build many of the applications and products you see today on Azure. We're providing the underlying data fabric, storage capability, compute [and] a lot of the weight that data brings." Here's theCUBE's complete video interview with Lee Caswell: Emerging technologies, such as quantum computing, artificial intelligence and next-generation networking, are transforming enterprise operations, according to theCUBE's Strechay and Laliberte. Key innovations, such as hollow-core fiber, are critical in reducing latency and meeting the increasing demands of AI-driven data flows, making them essential for enterprise scalability. "Hollow-core fiber essentially enables organizations to carry more capacity over greater distances with a lower latency," Laliberte said in an analyst segment during the event. "It just seems like this is a technology that everyone is going to be implementing. Microsoft is saying, 'Hey, this is a great application for it in those backend AI data centers,' and not only within the data center, but across data centers and being able to stretch that out." As enterprises adopt multicloud strategies, smooth integration and operational efficiency have become priorities, according to Kambiz Aghili, vice president of Oracle cloud infrastructure at Oracle Corp. Oracle Cloud Infrastructure addresses these needs by providing cross-cloud redundancy and interoperability, enabling businesses to access Oracle and non-Oracle data states wherever needed. "Customers want access to Oracle and non-Oracle data states wherever they need them, and right next to one another, they want to be able to build new applications that benefit them," Aghili told theCUBE during the event. "They want to be able to use relational data, non-relational machine learning graph, text spatial operations right next to one another with the same [structured query language and] REST API as they are used to." OCI simplifies AI deployment and supports multicloud AI strategies through integration capabilities, 24/7 availability and comprehensive data governance. Eliminating complexities such as patching and maintenance helps enterprises scale efficiently while ensuring compliance, according to Aghili. "Customers do not want to deal with patching and maintenance of getting from version to version," he said. "So, making these online services -- from online security to patching -- for customers that really need 24/7 availability, when you put that in conjunction with the data governance and the compliance that they really need us to meet on their behalf, making that absolutely a checkbox has been super successful and well-received with the customers today." Here's theCUBE's complete video interview with Kambiz Aghili: To watch more of theCUBE's coverage of the event, here's our complete event video playlist:
Share
Share
Copy Link
A comprehensive look at the latest advancements in high-performance computing and multicloud AI strategies, highlighting key insights from SC24 and Microsoft Ignite 2024 events.

High-performance computing (HPC) is undergoing a significant transformation, driven by the emergence of scalable AI infrastructure. This shift is democratizing supercomputing capabilities, making advanced technologies accessible to enterprises of all sizes
1
. John Furrier, executive analyst at theCUBE Research, notes that the industry is witnessing a real build-out of infrastructure hardware, signaling the end of traditional IT and the advent of new systems1
.The exponential growth of AI has pushed traditional computing frameworks to their limits, necessitating clustered systems for scaling modern workloads. Hasan Siraj from Broadcom Inc. emphasizes that networking advances serve as the crucial "glue" connecting these clusters, enabling efficient training of large language models while addressing latency and bandwidth challenges
1
.As AI adoption accelerates, power consumption and cost have become significant challenges. Collaborations between companies like Super Micro Computer Inc., WekaIO Inc., and Nvidia Corp. are pioneering energy-efficient AI data centers. Nilesh Patel from Weka highlights the importance of addressing power requirements and cost efficiency in data center growth
1
.The evolution of AI demands modular systems that prioritize efficiency, flexibility, and scalability. Companies like Broadcom, Dell, and Denvr Dataworks are developing AI factories designed for compact, energy-efficient operations. These modular superclusters integrate over 1,000 GPUs in under 900 square feet, leveraging advanced liquid immersion cooling to optimize power usage and space
1
.Multicloud AI strategies are revolutionizing how organizations approach secure, scalable, and integrated AI solutions across complex technology landscapes. At Microsoft Ignite 2024, industry leaders spotlighted innovative approaches to simplifying AI deployment and enhancing productivity
2
.Related Stories
Dell Technologies has emphasized zero-trust security measures with its APEX Protection Services for Microsoft Azure. This service integrates data immutability, encryption, and multi-factor authentication to safeguard customer data, offering resilience alongside robust protection
2
.Collaborations between cloud providers and technology leaders are unlocking new possibilities for AI-driven hybrid cloud infrastructures. Nutanix Inc. has partnered with Microsoft Azure to create a hybrid AI infrastructure that enables seamless operation across on-premises and cloud environments. This partnership delivers flexibility for industries such as healthcare and finance
2
.Quantum computing, artificial intelligence, and next-generation networking are transforming enterprise operations. Innovations such as hollow-core fiber are critical in reducing latency and meeting the increasing demands of AI-driven data flows, making them essential for enterprise scalability
2
.As the landscape of enterprise computing continues to evolve, these advancements in high-performance computing and multicloud AI strategies are set to play a crucial role in shaping the future of technology and business operations.
Summarized by

Navi