3 Sources
[1]
NVIDIA Vera CPU Benchmarks: Olympus Cores Delivering The Best Performance Ever Seen On ARM Review
NVIDIA's Vera data center CPU isn't ramping up until later this year but I recently had the opportunity to try out this new ARM-based CPU designed for agentic AI workloads. NVIDIA's Vera CPU with its in-house-designed Olympus CPU cores ends up packing a heavy-hitting punch with competitiveness to Intel/AMD x86_64 CPUs that I have never seen out of any other ARM or non-x86_64 processors. Continue on with these early benchmarks of the NVIDIA Vera CPU on Linux. Vera is NVIDIA's next-gen data center CPU designed for agentic AI and similar modern data center workloads. Vera most notably will be found with NVIDIA NVL72 Vera Rubin as the host CPU for powering these powerful racks for AI while it will also be found standalone for CPU racks. Unlike NVIDIA's Grace that uses Arm Neoverse-V2 cores, Vera makes use of NVIDIA's in-house "Olympus" core design. Vera features 88 Olympus cores that claim to deliver 2x the performance of its predecessor with leading energy efficiency too. Olympus is compatible with the Armv9.2 ISA and supports FP8 precision, 176 threads in total via spatial multi-threading, and is paired with LPDDR5X memory for delivering up to 1.2TB/s of memory bandwidth. Over Grace, Vera also has double the L2 cache at 2MB per core, a larger unified L3 cache at 164MB, and supports PCIe Gen 6 as well as CXL 3.1 connectivity. The Vera CPU as tested for this initial benchmarking had a 450 Watt socket TDP. With the LPDDR5X memory it's around 50 Watts or less of power consumption. NVIDIA Vera data center CPUs remain on track for shipping in the second half of the year, but ahead of the ramp, NVIDIA invited me to their Santa Clara headquarters to run some of the first public benchmarks of this new CPU with their Olympus cores. In this article are those very initial results. But before talking performance, it's first important to note the level of Linux support. With Vera not officially out there yet I really didn't know what to expect for the upstream Linux kernel support and the like or what their plans are among the major ARM64 Linux distributions, etc. Fortunately, NVIDIA Vera is already in good shape with upstream Linux kernel support. Linux 7.1+ has the key driver support in place and Vera should work on ARM64 server Linux distributions like Ubuntu, Fedora, etc. NVIDIA will also continue to provide Base OS as their modified version of Ubuntu with all the Vera patches ready. ACPI is relied upon and no needing to deal with frustrating Device Trees or other headaches for NVIDIA Vera on Linux. With Vera being compatible with Armv9.2 and complying with Arm's Server Base System Architecture (SBSA), it ends up making use of many of the common ARM Linux drivers for support, as part of the reason it's more difficult pre-launch to track the upstream kernel support status in advance of hardware availability. Among that common ARM Linux code being used is the ongoing work around Arm Confidential Compute (CCA) for confidential computing with VMs to be supported by Vera. Last year GCC and LLVM Clang added support for the Olympus cores. That means GCC 16.1+ or LLVM Clang 21+ for building optimized binaries for Vera. It was great and ambitious of NVIDIA introducing this Olympus compiler support so early and they can be applauded for that initiative. For comparison, while NVIDIA upstreamed their Olympus support in March of 2025, it was only in December when the AMD Zen 6 (znver6) support was added to GCC and this February when it appeared in LLVM/Clang. NVIDIA's Olympus compiler upstreaming is akin to the well-in-advance support we have been used to seeing from Intel over the years. Great seeing NVIDIA take similar initiative with their Vera upstreaming for compilers and at large. Hopefully this continues with future NVIDIA CPU generations. I wasn't able to test loading different Linux distributions on NVIDIA Vera or the like, but from everything told while at NVIDIA, the upstream open-source Linux support is in good shape already for Vera. My testing was done on Ubuntu 24.04 LTS with their base OS configuration of a patched Linux 6.18 LTS kernel paired with GCC 16.1. The NVIDIA Vera with its Olympus cores were working nicely on Linux. One caveat is some power management tuning still in the process of being upstreamed. Recently I covered on NVIDIA working on ACPI CPPC v4 support for Linux and this appears to be one of the related bits. Due to that power management tuning still happening, unfortunately, NVIDIA asked that CPU power consumption monitoring not be engaged during this initial round of benchmarking. Similarly, CPU frequency monitoring was not allowed either during this first round of testing. Additionally, the NVIDIA Vera benchmarking was done from one of their pre-production, open-platform system where as it will be more relevant for looking at power and frequency in an actual production, enclosed server chassis designs later in 2026. NVIDIA also requested only specific workloads relevant to the intended workloads/domains that Vera is catering to in the data center be tested. So this first round of Vera benchmarking isn't too comprehensive across the spectrum of possible workloads but limited to the benchmarks that were permitted based on what they feel were most relevant -- plus the fact I was only spending one day at NVIDIA's offices. For these initial NVIDIA Vera benchmarks they preferred the scope of benchmarks be limited to target use-cases they feel most relevant for their modern data center customers. This isn't a sponsored article but I obliged to their requests in order to run these initial Vera CPU benchmarks. Hopefully in future rounds of Vera testing over the months ahead there will be a more widespread set of benchmarks for those curious about the Olympus CPU core performance at large. Similarly, in being able to report on the power efficiency and performance-per-Watt once their power management code is tuned.
[2]
NVIDIA Vera CPU Is 'Packing a Heavy-Hitting Punch' Against Competition
In the new Phoronix benchmark, Vera delivers winning performance and memory results for agentic AI. The shift to agentic AI creates a new CPU requirement for the AI factory: fast cores, massive memory bandwidth and the ability to sustain high performance when all cores are active. Initial benchmark results published by Phoronix today show that the NVIDIA Vera CPU meets this need. For this first public look, the benchmark scope was centered on the agentic workloads Vera was designed for in the modern data center. The Vera CPU delivers the throughput AI factories need while optimizing platform power. Eighty-eight NVIDIA custom Olympus cores, 1.2TB/s of memory bandwidth and a high-speed, on-chip fabric results in a CPU platform that combines core performance and memory bandwidth in an efficient power envelope. NVIDIA Olympus Delivers Aggressive Performance At the heart of Vera are custom NVIDIA Olympus CPU cores. Fully compatible with the Armv9.2 instruction set architecture, Olympus is designed for the sequential CPU work underpinning agentic AI: branch-heavy runtimes, sandboxed code, data processing and orchestration. Vera's monolithic die, wide cores, advanced branch prediction and the second-generation NVIDIA Scalable Coherency Fabric help Vera keep data moving across all 88 cores. Phoronix's testing of a single-socket Vera CPU -- rated at 450-watt thermal design power with less than 30 watts of memory power -- showed that it delivers outstanding performance within that power profile, along with generational gains across a broad array of workloads spanning code compilation, file compression, video transcoding, Python, Java and database management. These are the same kinds of CPU-heavy tasks that agents and AI factories run every day: compiling code, executing runtimes, compressing data, querying databases and coordinating large software stacks. "Going into this, I didn't really know what to expect of NVIDIA's Vera with the new Olympus cores," wrote Michael Larabel, founder and principal author of Phoronix. "But in the end I was left realizing this is the most formidable competition to Intel and AMD x86_64 processors ever realized." 'Incredible Advantage' in Memory Performance Agentic workloads are not limited by core count alone. They need high core utilization and sustained memory bandwidth, making memory performance per watt a critical part of overall CPU efficiency. Vera incorporates a second-generation LPDDR5X memory subsystem, enabling dramatically lower energy per bit compared to DDR5. This allows Vera to offer up to a massive 1.2 TB/s of bandwidth -- up to 2x the peak memory bandwidth compared with traditional CPUs in less than 30 watts of memory power, as opposed to more than 100 watts for traditional DDR5. In Phoronix STREAM TRIAD testing, Vera sustained 90% of its peak memory bandwidth -- achieving the highest percentage of rated peak bandwidth of any CPU tested by Phoronix -- and delivered over 4x the memory bandwidth per core compared with traditional x86 CPUs. "NVIDIA Vera with its LPDDR5X memory was showing its incredible advantage in memory performance over current Intel Xeon and AMD EPYC processors," Larabel wrote. However, peak bandwidth is only part of the story. AI factory workloads run many sandboxes, tool calls and data services at the same time. In separate testing with Vera, Prime Intellect found that Vera maintained high bandwidth and low, consistent memory latency as more workloads ran in parallel -- the kind of predictable performance needed for agentic AI. A Large Generational Leap -- and Leadership in Phoronix Testing Compared with the prior-generation NVIDIA Grace CPU, Vera delivered a 1.6x geometric mean increase in Phoronix's testing -- an incredible generation-over-generation gain. "The difference from Grace to Vera was consistently exceeding my expectations for gen-on-gen performance we typically see for processors," Larabel wrote. "NVIDIA's Vera CPU with its in-house-designed Olympus CPU cores ends up packing a heavy-hitting punch with competitiveness to Intel/AMD x86_64 CPUs that I have never seen out of any other ARM or non-x86_64 processors." Vera led the tested CPU field, delivering a 1.5x overall performance advantage compared with a latest-generation 128-core x86 processor. The gains showed up in practical developer workloads. Single-socket Vera compiled a default Linux kernel in just 20 seconds, the fastest result Phoronix measured in that test. Vera delivered 2x faster Linux kernel compilation on a per-core basis compared with a 128-core processor. "On a [geometric] mean basis, the NVIDIA Vera delivered 10% better performance than the AMD EPYC 9575F 5.0 GHz high frequency processor," Larabel wrote. Vera in Customer Testing, Coming Soon From Partners At NVIDIA GTC, NVIDIA announced widespread ecosystem support for Vera, spanning AI natives, supercomputing centers, cloud service providers and infrastructure providers. NVIDIA has also delivered the first Vera CPUs to leading AI companies and cloud providers, marking an important milestone as Vera moves toward partner availability in the second half of the year. Vera will be available from partners in dual- and single-socket systems, with air-cooled and liquid-cooled options to support AI factory deployments, from standard enterprise data centers to high-density agentic AI infrastructure. Learn more about NVIDIA Vera.
[3]
NVIDIA's Vera CPU With 88 Olympus "Arm" Cores Outperforms AMD EPYC & Intel Xeon In First Benchmarks
The first NVIDIA Vera CPU benchmarks have been released, showcasing a huge gain over Grace while outperforming the latest AMD EPYC & Intel Xeon chips. NVIDIA Vera CPU Brings An Impressive 63% Performance Bump Over Grace CPU, Challenges x86 Chips From AMD & Intel NVIDIA recently announced that its Vera CPUs were in full production, hand-delivering the first CPU racks to major AI firms such as OpenAI, SpaceX, Anthropic, and Oracle. Vera is a fundamental component of the Extreme Co-Design ecosystem powering the Rubin platform, but with Vera, NVIDIA is entering into a new market for the first time: Standalone CPUs. The ARM-based CPU with 88 custom Olympus cores is all set to deliver some big upgrades, with NVIDIA promising 50% better performance, twice the performance per watt (efficiency), and four times the density per rack vs traditional x86 CPUs. It's a CPU that is purpose-built for Agentic AI and Inference domains. Now, the first benchmarks of NVIDIA's Vera CPU have been published by Phoronix, and they showcase a big leap in performance. In the Geomean of all test results, the NVIDIA Vera CPU with 88 Olympus cores ended up 63% faster than its predecessor, the 72-core Grace CPU. The CPU was also 10% faster than AMD's EPYC 9575F, which has a total of 64 "Zen 5" cores clocked at 5 GHz, while also beating the Intel Xeon 6980P, a 128-core chip based on the Granite Rapids family, by 55%. These benchmarks give us a first look at the strong performance that NVIDIA's Vera CPU has to offer, and it's more important than ever to look at NVIDIA's CPU designs since they will not only land in Vera Rubin AI systems, but will also be available in standalone models, with NVIDIA aiming to become the biggest CPU supplier in 2026. The demand for Vera is huge, and since Agentic AI loves CPUs, Vera is primed to eat a big share out of Intel and AMD server segments. At the same time, some important metrics, such as performance per watt (efficiency), are missing from the tests. Phoronix states that they were not permitted to run or showcase these tests. The Vera module they got was an early pre-production hardware, so it looks like we may see some pre-release power tuning & optimizations that will further help the system achieve better performance/efficiency. Phoronix's conclusion states that this is the "most performant ARM Linux server processor" they have ever tested. Looking ahead, NVIDIA Vera CPUs do face some heated competition in the form of AMD's next-gen EPYC Venice, based on the Zen 6 core architecture, which is already in mass production and primed for a 2H 2026 release. Intel is gearing up its Diamond Rapids platform while Qualcomm and Arm are also working on their own data-center-specific CPUs for the Agentic AI race. Follow Wccftech on Google to get more of our news coverage in your feeds.
Share
Copy Link
NVIDIA's Vera CPU with custom Olympus cores has posted impressive benchmark results, showing a 63% performance boost over its predecessor Grace and beating AMD's EPYC 9575F by 10%. The ARM-based CPU features 88 cores, 1.2TB/s memory bandwidth, and is purpose-built for agentic AI workloads. First public testing shows it delivering the strongest ARM server performance ever measured.
NVIDIA has achieved a significant milestone with its Vera data center CPU, as first public NVIDIA Vera CPU benchmarks reveal performance levels that challenge the dominance of traditional x86_64 CPUs. The ARM-based CPU, featuring 88 custom NVIDIA Olympus cores, delivered a 63% performance improvement over the previous-generation Grace CPU in testing conducted by Phoronix
1
. Michael Larabel, founder of Phoronix, stated this is "the most formidable competition to Intel and AMD x86_64 processors ever realized"2
. The NVIDIA Vera CPU is designed specifically for agentic AI workloads and modern data center operations, representing NVIDIA's ambitious entry into the standalone CPU market.
Source: Wccftech
The Olympus cores at the heart of Vera demonstrated exceptional capabilities across diverse workloads. In geometric mean calculations across all tests, the NVIDIA Vera CPU outperforms AMD EPYC 9575F—a 64-core Zen 5 processor running at 5 GHz—by 10%
2
. More impressively, it beat Intel Xeon 6980P, a 128-core Granite Rapids chip, by 55%3
. The single-socket Vera compiled a default Linux kernel in just 20 seconds, the fastest result Phoronix has measured in that test, delivering 2x faster Linux kernel compilation on a per-core basis compared with a 128-core processor2
. These results span practical developer workloads including code compilation, file compression, video transcoding, Python, Java, and database management—the same CPU-heavy tasks that agents and AI factories run daily.Beyond raw processing power, the NVIDIA Vera CPU demonstrates exceptional memory performance critical for agentic AI workloads. Equipped with LPDDR5X memory, Vera achieves up to 1.2TB/s of memory bandwidth while consuming less than 30 watts of memory power, compared to over 100 watts for traditional DDR5 systems
2
. In STREAM TRIAD testing, Vera sustained 90% of its peak memory bandwidth—the highest percentage of rated peak bandwidth of any CPU tested by Phoronix—and delivered over 4x the memory bandwidth per core compared with traditional x86 CPUs2
. This memory advantage proves essential for inference domains where multiple sandboxes, tool calls, and data services run simultaneously. The 88 Olympus cores are compatible with Armv9.2 ISA, support FP8 precision, deliver 176 threads via spatial multi-threading, and feature double the L2 cache at 2MB per core compared to Grace, plus a larger unified L3 cache at 164MB1
.Source: Phoronix
Related Stories
NVIDIA has taken an aggressive approach to ecosystem readiness. Linux 6.11+ includes key driver support, and the pre-production unit tested ran Ubuntu 24.04 LTS with a patched Linux 6.18 LTS kernel paired with GCC 16.1
1
. GCC 16.1+ and LLVM Clang 21+ now support building optimized binaries for Olympus cores, with compiler support upstreamed in March 2025—months before the CPU's second-half 2025 launch1
. This mirrors Intel's historical approach to early compiler integration. NVIDIA has already hand-delivered first CPU racks to OpenAI, SpaceX, Anthropic, and Oracle3
. The company aims to become the biggest CPU supplier in 2026, targeting a substantial share of Intel and AMD server segments as demand for agentic AI accelerates.
Source: NVIDIA
While performance results impress, complete power efficiency data remains unavailable. Testing occurred with a 450-watt socket TDP on a pre-production, open-platform system, and NVIDIA requested that CPU power consumption and frequency monitoring not be engaged during this initial benchmarking round
1
. Some power management tuning related to ACPI CPPC v4 support still requires upstreaming1
. Production systems in enclosed server chassis designs later in 2026 will provide clearer efficiency metrics. Looking ahead, Vera faces heated competition from AMD's next-gen EPYC Venice based on Zen 6 core architecture, already in mass production for 2H 2026 release, plus Intel's Diamond Rapids platform and offerings from Qualcomm and Arm targeting the agentic AI race3
. The widespread ecosystem support announced at NVIDIA GTC, spanning AI natives, supercomputing centers, cloud service providers, and infrastructure providers, positions Vera to capture significant market share as AI factories prioritize CPUs capable of sustaining high performance when all cores are active.Summarized by
Navi
[1]
26 Jan 2026•Technology
19 May 2026•Technology

25 Feb 2026•Technology

1
Technology

2
Policy and Regulation

3
Science and Research
