NVIDIA and Stability AI Optimize Stable Diffusion 3.5 for RTX GPUs, Reducing VRAM Usage by 40%

Reviewed byNidhi Govil

2 Sources

NVIDIA collaborates with Stability AI to optimize Stable Diffusion 3.5, reducing VRAM requirements and doubling performance on RTX GPUs. This breakthrough enables more widespread use of the powerful AI image generation model.

NVIDIA and Stability AI Collaborate on Stable Diffusion 3.5 Optimization

NVIDIA has partnered with Stability AI to significantly enhance the performance and accessibility of Stable Diffusion 3.5, one of the world's most popular AI image generation models. This collaboration has resulted in substantial improvements in both efficiency and VRAM usage, making the model more accessible to a wider range of NVIDIA RTX GPUs 12.

Reducing VRAM Requirements

Source: TweakTown

Source: TweakTown

The base Stable Diffusion 3.5 Large model initially required over 18GB of VRAM, limiting its use to high-end GPUs. Through quantization techniques, NVIDIA and Stability AI have managed to reduce the VRAM requirement by 40%, bringing it down to 11GB 1. This optimization allows the model to run on five GeForce RTX 50 Series GPUs instead of just one, significantly expanding its potential user base 2.

Performance Enhancements with TensorRT

In addition to VRAM reduction, NVIDIA has applied its TensorRT software development kit (SDK) to optimize Stable Diffusion 3.5 Large and Medium models. These optimizations take full advantage of the Tensor Cores in RTX GPUs, resulting in impressive performance gains 1:

  1. SD3.5 Large: 2.3x performance boost compared to BF16 PyTorch
  2. SD3.5 Medium: 1.7x performance increase over BF16 PyTorch

TensorRT for RTX: Reimagined for AI PCs

NVIDIA has also introduced a new version of TensorRT specifically designed for RTX AI PCs. This updated SDK offers several key improvements 1:

  1. Just-in-time (JIT) on-device engine building
  2. 8x smaller package size
  3. Seamless AI deployment to over 100 million RTX AI PCs

The new TensorRT for RTX is now available as a standalone SDK for developers, allowing for easier integration and optimization of AI models on RTX hardware 1.

Implications for AI Development and Deployment

These advancements have significant implications for AI developers and end-users:

  1. Wider accessibility: More GPUs can now run complex AI models like Stable Diffusion 3.5
  2. Improved efficiency: Reduced VRAM usage and increased performance allow for faster and more resource-efficient AI operations
  3. Easier deployment: The new TensorRT for RTX SDK simplifies the process of optimizing and deploying AI models on RTX hardware

Future Developments

NVIDIA and Stability AI are planning to release Stable Diffusion 3.5 as an NVIDIA NIM microservice in July, further simplifying access and deployment for creators and developers across various applications 1.

As AI models continue to grow in complexity and capability, optimizations like these will play a crucial role in making advanced AI tools more accessible to a broader range of users and devices. With NVIDIA's announcement of over 100 million RTX AI PCs worldwide, the potential impact of these improvements is substantial 2.

Explore today's top stories

Google Unveils Pixel 10 Series: AI-Powered Features and Camera Upgrades Take Center Stage

Google has launched its new Pixel 10 series, featuring improved AI capabilities, camera upgrades, and the new Tensor G5 chip. The lineup includes the Pixel 10, Pixel 10 Pro, and Pixel 10 Pro XL, with prices starting at $799.

Ars Technica logoTechCrunch logoCNET logo

60 Sources

Technology

9 hrs ago

Google Unveils Pixel 10 Series: AI-Powered Features and

Google Unveils AI-Powered Pixel 10 Smartphones with Advanced Gemini Features

Google launches its new Pixel 10 smartphone series, showcasing advanced AI capabilities powered by Gemini, aiming to compete with Apple in the premium handset market.

Bloomberg Business logoThe Register logoReuters logo

22 Sources

Technology

9 hrs ago

Google Unveils AI-Powered Pixel 10 Smartphones with

NASA and IBM Unveil Surya: An AI Model to Predict Solar Flares and Space Weather

NASA and IBM have developed Surya, an open-source AI model that can predict solar flares and space weather with improved accuracy, potentially helping to protect Earth's infrastructure from solar storm damage.

New Scientist logoengadget logoGizmodo logo

6 Sources

Technology

17 hrs ago

NASA and IBM Unveil Surya: An AI Model to Predict Solar

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered Wearables

Google's latest smartwatch, the Pixel Watch 4, introduces significant upgrades including a curved display, AI-powered features, and satellite communication capabilities, positioning it as a strong competitor in the smartwatch market.

TechCrunch logoCNET logoZDNet logo

18 Sources

Technology

9 hrs ago

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered

FieldAI Secures $405M Funding to Revolutionize Robot Intelligence with Physics-Based AI Models

FieldAI, a robotics startup, has raised $405 million to develop "foundational embodied AI models" for various robot types. The company's innovative approach integrates physics principles into AI, enabling safer and more adaptable robot operations across diverse environments.

TechCrunch logoReuters logoGeekWire logo

7 Sources

Technology

9 hrs ago

FieldAI Secures $405M Funding to Revolutionize Robot
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo