2 Sources
[1]
NVIDIA, Ineffable Intelligence Team Up to Build the Future of Reinforcement Learning Infrastructure
Together, we are building the reinforcement learning infrastructure that unlocks new levels of intelligence. Reinforcement-learning agents -- AI systems that learn by trial and error -- can convert computation into new knowledge. That's the focus of a new engineering-level collaboration between NVIDIA and Ineffable Intelligence, the London-based AI lab founded by AlphaGo architect David Silver in the wake of Ineffable's emergence from stealth last week. "The next frontier of AI is superlearners -- systems that learn continuously from experience," said Jensen Huang, founder and CEO of NVIDIA. "We are thrilled to partner with Ineffable Intelligence to codesign the infrastructure for large-scale reinforcement learning as they push the frontier of AI and pioneer a new generation of intelligent systems." Silver is one of the pioneers of reinforcement learning, an approach that has transformed AI research. He's focused on further developing this approach into a new paradigm. "Researchers have largely solved the easier problem of AI: how to build systems that know all the things humans already know," Silver said. "But now we need to solve the harder problem of AI: how to build systems that discover new knowledge for themselves. That requires a very different approach -- systems that learn from experience." That kind of learning needs a powerful and highly optimized pipeline to support it. Unlike pretraining, where a fixed dataset of human data flows through the system, reinforcement learning workloads generate their data on the fly. The system has to act, observe, score and update continuously in tight loops, which puts pressure on interconnect, memory bandwidth and serving in ways that pretraining doesn't. Furthermore, the system will train on rich forms of experience that are quite distinct from human language and other human data, and may require novel model architectures and training algorithms. That's where NVIDIA and Ineffable are focusing their technical work: building a pipeline that can feed reinforcement learning systems at scale. Engineers from both companies have teamed up to explore the best way to create this training pipeline. This work is starting on NVIDIA Grace Blackwell, and will be among the first to explore the upcoming NVIDIA Vera Rubin platform. The goal is to understand the next generation of hardware and software that will be required as the AI world shifts beyond human data toward models that learn through simulation and experience. Getting this infrastructure right will unlock an unprecedented scale of reinforcement learning in highly complex and rich environments, allowing agents to discover breakthroughs across all fields of knowledge.
[2]
Nvidia Announces Startup Partnership to Build Reinforcement-Trained AI
Nvidia said it is working with the artificial intelligence startup Ineffable Intelligence to build more reinforcement-learning agents. The chipmaker wants to build more of the AI agents, which learn by trial and error, to convert computation into new knowledge, it said Wednesday. Ineffable Intelligence is a London-based startup that emerged from stealth-mode development last week and was founded by AlphaGo architect David Silver. The startup will codesign the infrastructure for large-scale reinforcement learning. This form of learning allows AI to discover new knowledge itself, rather than solely learning from what humans already know. Reinforcement learning puts more strain on interconnect and requires more memory bandwidth, Nvidia said. It could also require new model architectures and training algorithms. Engineers from both companies will be working on figuring out the best way to support that type of AI training. The work is starting on Nvidia's Grace Blackwell program and will be among the first to use its upcoming Vera Rubin platform. Write to Katherine Hamilton at [email protected]
Share
Copy Link
NVIDIA has announced an engineering-level collaboration with Ineffable Intelligence, the London-based AI startup founded by AlphaGo architect David Silver. Together, they're building reinforcement learning infrastructure that enables AI systems to discover new knowledge through trial and error, rather than just learning from existing human data. The partnership will leverage NVIDIA's Grace Blackwell platform and upcoming Vera Rubin system.
NVIDIA has formed a strategic partnership with Ineffable Intelligence, marking a shift in how AI systems will be trained in the future. The collaboration brings together the chipmaker's hardware expertise with the vision of David Silver, the AlphaGo architect who founded the London-based AI startup that emerged from stealth mode last week
1
. At the heart of this partnership lies a fundamental challenge: building reinforcement learning infrastructure capable of supporting AI that can discover new knowledge rather than merely processing what humans already know.
Source: NVIDIA
The partnership focuses on developing reinforcement-trained AI agents that learn through trial and error, converting raw computation into novel insights. "The next frontier of AI is superlearners -- systems that learn continuously from experience," said Jensen Huang, founder and CEO of NVIDIA
1
. Silver emphasized the paradigm shift this represents: "Researchers have largely solved the easier problem of AI: how to build systems that know all the things humans already know. But now we need to solve the harder problem of AI: how to build systems that discover new knowledge for themselves"1
.This approach demands AI learning through simulation and experience, a fundamentally different process than traditional pretraining methods. Unlike conventional training where fixed datasets flow through systems, large-scale reinforcement learning workloads generate data dynamically. The system must act, observe, score, and update continuously in tight loops
1
.The technical complexity of reinforcement learning creates unique computational demands that existing infrastructure wasn't designed to handle. The continuous feedback loops place intense pressure on interconnect, memory bandwidth, and serving capabilities in ways that pretraining doesn't
1
. NVIDIA noted that reinforcement learning puts more strain on interconnect and requires more memory bandwidth2
. The systems will also train on rich forms of experience distinct from human language and other human data, potentially requiring novel model architectures and training algorithms1
.Engineers from both companies have teamed up to explore optimal approaches for creating this training pipeline. The work is starting on NVIDIA Grace Blackwell and will be among the first to explore the upcoming NVIDIA Vera Rubin platform
1
2
. This collaboration aims to understand the hardware and software needs required as the AI industry shifts beyond human data toward models that learn through autonomous exploration.Related Stories
The partnership signals a critical inflection point for the AI industry. Getting this infrastructure right will unlock unprecedented scale of reinforcement learning in highly complex and rich environments, allowing agents to discover breakthroughs across all fields of knowledge
1
. The AI startup Ineffable Intelligence will codesign the infrastructure for large-scale reinforcement learning, enabling AI to discover new knowledge itself rather than solely learning from what humans already know2
. This collaboration between NVIDIA and Ineffable Intelligence represents a bet that the next wave of AI breakthroughs will come not from larger datasets of human knowledge, but from systems that can learn and discover on their own.Summarized by
Navi
[1]
[2]
28 Feb 2026•Technology

22 Apr 2026•Technology

06 May 2025•Technology

1
Technology

2
Technology

3
Technology
