AMD Unveils Instinct MI350 Series: A Powerhouse in AI Acceleration

2 Sources

Share

AMD's Instinct MI350 series, featuring advanced 3D chiplet design and TSMC's N3P process, sets new benchmarks in AI acceleration with impressive specs and performance gains over competitors.

AMD's Technological Marvel: The Instinct MI350 Series

AMD has unveiled its latest AI accelerator series, the Instinct MI350, at Hot Chips 2025, showcasing a significant leap in AI hardware capabilities. This new series, based on the CDNA 4 architecture, is designed to meet the growing demands of large language models (LLMs) and complex AI workloads

1

2

.

Advanced Architecture and Manufacturing

Source: Wccftech

Source: Wccftech

The Instinct MI350 series leverages cutting-edge technology, featuring a 3D Multi-Chiplet layout with 185 billion transistors. AMD has employed a dual-process approach, utilizing TSMC's N3P (3nm) process for the Accelerator Complex Dies (XCDs) and the N6 (6nm) process for the I/O Base Die (IOD). This combination, along with TSMC's CoWoS-S advanced packaging, allows for optimal performance and cost-effectiveness

1

2

.

Memory and Bandwidth Innovations

A standout feature of the MI350 series is its impressive memory capabilities. The flagship MI355X model boasts up to 288GB of HBM3E memory, with eight 12-Hi stacks each providing 36GB at 8Gbps. This configuration delivers a total bandwidth of 8TB/sec, significantly outpacing competitors

1

.

The series also incorporates 256MB of AMD Infinity Cache and utilizes Infinity Fabric Advanced Package (AP) interconnect, providing 5.5TB/sec of bi-directional bandwidth between chiplets

1

.

Performance and Efficiency Enhancements

AMD has made substantial improvements in both performance and efficiency:

  1. The MI355X can reach clock speeds of up to 2.4GHz with a 1400W TBP (Thermal Design Power) in its liquid-cooled variant

    1

    2

    .
  2. The series supports new data formats, including full-access FP8 and industry-standard micro-scaled MXFP6 and MXFP4, enabling faster AI training and inference

    2

    .
  3. Compared to its predecessor, the MI300X, the MI355X shows significant performance gains across various data formats, including a 1.8x increase in Matrix FP16/BF16 and a 1.9x increase in Matrix FP8

    2

    .

Competitive Edge

Source: TweakTown

Source: TweakTown

AMD claims that the Instinct MI355X outperforms NVIDIA's B200 in several key areas:

  • 1.6x higher memory capacity
  • 1.1x higher bandwidth
  • 2.1x higher FP64 performance
  • 1.1x higher FP16, FP8, and FP4 performance
  • 2.2x higher FP6 performance

    1

Flexible Configuration and Scalability

The MI350 series supports flexible GPU partitioning, allowing for various configurations of the XCDs. This flexibility enables the chip to support up to 8 instances of 70B models in CPX+NPS2 AI workloads, demonstrating its capability to handle large-scale AI tasks

1

2

.

Impact on AI and HPC Landscape

With its advanced features and performance capabilities, the AMD Instinct MI350 series is poised to make a significant impact in the AI and high-performance computing (HPC) sectors. Its ability to handle larger models with improved efficiency addresses the growing demands of AI researchers and enterprises working with increasingly complex AI systems

2

.

As the AI hardware race intensifies, AMD's latest offering demonstrates the company's commitment to innovation and its ability to compete at the highest levels of the AI accelerator market. The Instinct MI350 series represents a crucial step forward in enabling more powerful and efficient AI computations, potentially accelerating advancements across various AI applications and research fields.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo