Apple and NVIDIA Collaborate on ReDrafter Technique to Boost LLM Performance

3 Sources

Apple and NVIDIA have joined forces to integrate the ReDrafter technique into NVIDIA's TensorRT-LLM framework, significantly improving the speed and efficiency of large language models.

News article

Apple and NVIDIA Join Forces to Enhance LLM Performance

In a surprising collaboration, tech giants Apple and NVIDIA have partnered to improve the performance of large language models (LLMs). The focus of this partnership is the integration of Apple's Recurrent Drafter (ReDrafter) technique with NVIDIA's TensorRT-LLM framework, aiming to significantly boost text generation speeds in AI models 12.

Understanding ReDrafter

ReDrafter, a technique open-sourced by Apple earlier this year, combines two approaches to enhance LLM performance:

  1. Beam search: A mechanism that explores multiple possibilities for a solution.
  2. Dynamic tree attention: A process where tree-structured data is processed using an attention mechanism.

This innovative approach can speed up LLM token generation by up to 3.5 tokens per generation step 2.

Integration with NVIDIA's TensorRT-LLM

To make ReDrafter production-ready for NVIDIA GPUs, the two companies collaborated to integrate it into the NVIDIA TensorRT-LLM inference acceleration framework. This integration required NVIDIA to add new operators and expose existing ones, significantly improving TensorRT-LLM's capability to accommodate sophisticated models and decoding methods 1.

Impressive Performance Gains

The collaboration has yielded remarkable results:

  • A 2.7x speed-up in generated tokens per second for greedy decoding when benchmarking a tens-of-billions parameter production model on NVIDIA GPUs 12.
  • Potential for significant reduction in latency, GPU usage, and power consumption 12.

Implications for AI Development

This technological advancement could have far-reaching effects on AI development and application:

  1. Reduced computational costs
  2. Improved user experience through lower latency in production applications
  3. Enhanced efficiency in AI model processing

Machine learning developers using NVIDIA GPUs can now easily benefit from ReDrafter's accelerated token generation for their production LLM applications with TensorRT-LLM 1.

A Unique Partnership

While this collaboration demonstrates the potential for Apple and NVIDIA to work together, it's important to note that this appears to be a short-term partnership focused on specific technological advancements. Given the companies' past history, a long-term business relationship seems unlikely 13.

Market Impact

Both Apple and NVIDIA are major players in the tech industry:

  • Apple reported Q4 revenue of $94.9 billion, surpassing analyst expectations 3.
  • NVIDIA's Q3 revenue reached $35.1 billion, marking a 94% increase compared to the previous year 3.

Together, these tech giants are valued at approximately $7 trillion, with Apple being the most valuable company globally and NVIDIA ranking third 3.

This collaboration between two industry leaders highlights the ongoing race to improve AI technologies and could potentially reshape the landscape of AI development and application in the near future.

Explore today's top stories

Google's AlphaEarth Foundations: AI-Powered 'Virtual Satellite' Revolutionizes Earth Observation

Google DeepMind introduces AlphaEarth Foundations, an AI model that acts as a 'virtual satellite' to map and analyze Earth's surface with unprecedented accuracy and efficiency, potentially transforming environmental monitoring and resource management.

Wired logoThe Verge logoAndroid Police logo

5 Sources

Technology

6 hrs ago

Google's AlphaEarth Foundations: AI-Powered 'Virtual

Google to Sign EU's AI Code of Practice, Highlighting Big Tech Divide on AI Regulation

Google announces its intention to sign the European Union's AI Code of Practice, a voluntary framework aimed at helping companies comply with the EU's AI Act. This decision contrasts with Meta's refusal, highlighting a growing divide among tech giants on AI regulation.

Ars Technica logoTechCrunch logoReuters logo

11 Sources

Policy and Regulation

14 hrs ago

Google to Sign EU's AI Code of Practice, Highlighting Big

Palo Alto Networks Acquires CyberArk for $25 Billion, Targeting AI-Driven Cybersecurity Threats

Palo Alto Networks has agreed to acquire Israeli cybersecurity firm CyberArk for $25 billion, marking a significant move in the cybersecurity industry to address emerging AI-driven threats and identity security challenges.

The Register logoReuters logoAxios logo

12 Sources

Business and Economy

14 hrs ago

Palo Alto Networks Acquires CyberArk for $25 Billion,

Meta Shifts Stance on Open-Source AI as Zuckerberg Unveils 'Personal Superintelligence' Vision

Mark Zuckerberg signals a potential shift in Meta's approach to open-source AI, citing safety concerns as the company pursues 'superintelligence'. This marks a significant change in Meta's AI strategy and its competition with rivals like OpenAI and Google DeepMind.

TechCrunch logoPC Magazine logo

2 Sources

Technology

6 hrs ago

Meta Shifts Stance on Open-Source AI as Zuckerberg Unveils

TSMC's AI Chip Dominance Propels Global Ranking and Revenue Growth

Taiwan Semiconductor Manufacturing Company (TSMC) experiences significant growth and global recognition due to the AI boom, with its CEO meeting world leaders and the company climbing Fortune's Global 500 ranking.

Fortune logoThe Motley Fool logo

2 Sources

Business and Economy

14 hrs ago

TSMC's AI Chip Dominance Propels Global Ranking and Revenue
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo