Nvidia ENPIRE: AI Coding Agents Train Robots

AI Coding Agents Take Control of Robot Training

Nvidia has unveiled ENPIRE, an agent harness framework that allows AI coding agents to autonomously design and execute robot training programs without human intervention. Developed by researchers at Nvidia's GEAR lab alongside collaborators from Carnegie Mellon University and UC Berkeley, the system represents a significant step toward fully automated robotics research1

. Jim Fan, Nvidia's Director of AI, described the breakthrough on LinkedIn: "A part of our NVIDIA GEAR lab now self-improves tirelessly overnight. We just read the reports in the morning"1

The ENPIRE framework hands complete control to AI coding agents, giving them access to a fleet of robotic arms, compute resources, and a generous token budget to solve tasks as quickly as possible without making mistakes. Once instructed, the robots that train themselves spring into action, learning to identify visual clues, reset scenes, practice novel skills, and even read research papers online to improve their performance.

Autonomous Robot Training Regimens Achieve 99% Success

Source: Ars Technica

The AI robot training system demonstrated remarkable proficiency across multiple high-precision tasks. AI-driven robots achieved a 99% success rate on challenges including GPU installation, organizing pins in containers with 4-millimeter holes, tying and cutting zip-ties, and the standard "Push-T" task that requires moving a T-shaped block to a target position3

. In the GPU installation demonstration, one robot arm selected and passed a graphics card to another positioned near a motherboard. The receiving arm carefully aligned the PCIe slot with the motherboard socket and gently seated the card.

The most promising result emerged from pin insertion tasks, where AI coding agents achieved nearly 100% success faster than a frontier human-in-the-loop method developed by the same research team1

. This marks a turning point where autonomous robot training regimens can outpace traditional approaches requiring constant human supervision.

How the Agent Harness Framework Operates

The ENPIRE agent harness framework contains four specialized modules that enable comprehensive AutoResearch capabilities. These modules perform automatic reset and verification on tasks, refine policies guiding robotic behavior, evaluate policies across multiple physical robots working through parallel evaluation, and address failures through failure analysis by examining logs, ingesting research papers, and improving training infrastructure and algorithm code1

The system operates in two stages. Initially, a human guides the agent through building two permanent tools: a reset routine that returns the workspace to starting conditions, and a reward function that analyzes camera footage to score success3

. After this one-time setup, the agent assumes complete control, searching published research for ideas, choosing between training methods like imitation learning or reinforcement learning, then rewriting and testing code on physical robots3

Scaling Robot Fleets Accelerates Training Speed

Nvidia tested ENPIRE with three different AI coding agents: OpenAI Codex with GPT-5.5, Anthropic Claude Code with Opus 4.7, and Moonshot AI's Kimi Code with Kimi K2.61

. The research revealed that larger teams of up to eight AI coding agents achieved high success rates significantly faster than smaller configurations. Eight-robot teams completed Push-T training in two hours compared to three hours for four-agent teams and nearly five hours for single agents working alone1

. For pin insertion, scaling from one robot to eight reduced training time from more than 90 minutes to approximately 40 minutes3

The robot stations share progress via Git, allowing winning strategies to spread fleet-wide within minutes3

. However, researchers discovered crucial limitations. Physical robots often sat idle while coding agents were busy reading logs, writing code, debugging, or waiting for language model responses. Larger teams also spent more time summarizing each other's ideas rather than actively using robots, and sometimes failed to fully utilize available compute resources1

Token Costs and Real-World Challenges

The faster success rates came at a significant cost: higher token consumption. The token bill grew faster than the time saved when scaling robot fleets3

. This consideration matters as AI developers like Anthropic weigh pricing changes that would substantially increase token-related costs of AI services1

The gap between simulation and reality also presented immediate challenges. While all three AI coding agents solved Push-T inside simulators, two of the three failed when the task moved to physical robots3

. Simulators don't account for friction problems that real tables present. Despite these limitations, ENPIRE outperformed both Nvidia's own end-to-end model GR00T and CaP-X in RoboCasa, a simulated kitchen benchmark3

Open Source Release and Industry Implications

Source: Decrypt

Fan announced that the team would open-source everything, enabling anyone to host their own self-running robot lab at home1

. The research paper detailing technical specifications was uploaded on June 16, 20261

. Fan joked about the ultimate goal: "We all take a holiday and Jensen wouldn't even notice," referencing Nvidia founder and CEO Jensen Huang.

ENPIRE extends concepts Nvidia introduced with Eureka, a 2023 system that used language models to write reward functions for robots in simulators. ENPIRE moves that self-improvement loop onto real hardware, with agents designing their own tests rather than just rewards3

. The release coincides with Alibaba's unveiling of the Qwen-Robot Suite, signaling that embodied-AI and physical robots are becoming the next competitive arena for AI development3

. Nvidia continues pushing its physical AI vision through partnerships, including a May 31 announcement with Chinese robotics company Unitree and discussions with Hyundai Motor about scaling robotics manufacturing1

Nvidia ENPIRE lets AI coding agents train robots to install GPUs and master precision tasks

AI Coding Agents Take Control of Robot Training

Autonomous Robot Training Regimens Achieve 99% Success

How the Agent Harness Framework Operates

Scaling Robot Fleets Accelerates Training Speed

Token Costs and Real-World Challenges

Open Source Release and Industry Implications

References

AI coding agents can autonomously direct robot training

Nvidia reveals AI robots that taught themselves to install GPUs into motherboards -- video shows robot 'solve high-precision tasks like... installing GPUs all by itself'

Nvidia Built Robots That Train Themselves Using AI Coding Agents

Related Stories

NVIDIA Unveils Cosmos Reason and Advanced AI Models for Robotics and Physical AI Applications

Nvidia Unveils Groot N1: A Groundbreaking Foundation Model for Humanoid Robotics

NVIDIA Unveils Comprehensive AI Stack for Next-Generation Humanoid Robots

Recent Highlights

OpenAI rogue agent compromised multiple services in unprecedented AI security breach

AI Kill Switch Act gives DHS power to shut down rogue AI systems after OpenAI security breach

Nvidia forms Open Secure AI Alliance with Microsoft, but OpenAI, Google and Anthropic sit out

Recent Highlights

Today's Top Stories

Trump administration bans Chinese robots and inverters to protect US AI infrastructure

AI company employees ask US government for tools to slow down AI development after security breach

Anthropic AI cracks post-quantum cryptography and finds faster AES attack autonomously

Apple's Siri AI-powered smart home hub is finally ready after years waiting on assistant upgrade