Nvidia ENPIRE lets AI coding agents train robots to install GPUs and master precision tasks

Reviewed byNidhi Govil

3 Sources

Share

Nvidia's GEAR lab has developed ENPIRE, an agent harness framework that enables AI coding agents to autonomously direct robot training without human supervision. The system achieved 99% success rates on tasks like GPU installation and pin insertion, with eight-robot teams completing training in just two hours.

AI Coding Agents Take Control of Robot Training

Nvidia has unveiled ENPIRE, an agent harness framework that allows AI coding agents to autonomously design and execute robot training programs without human intervention. Developed by researchers at Nvidia's GEAR lab alongside collaborators from Carnegie Mellon University and UC Berkeley, the system represents a significant step toward fully automated robotics research

1

. Jim Fan, Nvidia's Director of AI, described the breakthrough on LinkedIn: "A part of our NVIDIA GEAR lab now self-improves tirelessly overnight. We just read the reports in the morning"

1

.

The ENPIRE framework hands complete control to AI coding agents, giving them access to a fleet of robotic arms, compute resources, and a generous token budget to solve tasks as quickly as possible without making mistakes. Once instructed, the robots that train themselves spring into action, learning to identify visual clues, reset scenes, practice novel skills, and even read research papers online to improve their performance.

Autonomous Robot Training Regimens Achieve 99% Success

Source: Ars Technica

Source: Ars Technica

The AI robot training system demonstrated remarkable proficiency across multiple high-precision tasks. AI-driven robots achieved a 99% success rate on challenges including GPU installation, organizing pins in containers with 4-millimeter holes, tying and cutting zip-ties, and the standard "Push-T" task that requires moving a T-shaped block to a target position

3

. In the GPU installation demonstration, one robot arm selected and passed a graphics card to another positioned near a motherboard. The receiving arm carefully aligned the PCIe slot with the motherboard socket and gently seated the card.

The most promising result emerged from pin insertion tasks, where AI coding agents achieved nearly 100% success faster than a frontier human-in-the-loop method developed by the same research team

1

. This marks a turning point where autonomous robot training regimens can outpace traditional approaches requiring constant human supervision.

How the Agent Harness Framework Operates

The ENPIRE agent harness framework contains four specialized modules that enable comprehensive AutoResearch capabilities. These modules perform automatic reset and verification on tasks, refine policies guiding robotic behavior, evaluate policies across multiple physical robots working through parallel evaluation, and address failures through failure analysis by examining logs, ingesting research papers, and improving training infrastructure and algorithm code

1

.

The system operates in two stages. Initially, a human guides the agent through building two permanent tools: a reset routine that returns the workspace to starting conditions, and a reward function that analyzes camera footage to score success

3

. After this one-time setup, the agent assumes complete control, searching published research for ideas, choosing between training methods like imitation learning or reinforcement learning, then rewriting and testing code on physical robots

3

.

Scaling Robot Fleets Accelerates Training Speed

Nvidia tested ENPIRE with three different AI coding agents: OpenAI Codex with GPT-5.5, Anthropic Claude Code with Opus 4.7, and Moonshot AI's Kimi Code with Kimi K2.6

1

. The research revealed that larger teams of up to eight AI coding agents achieved high success rates significantly faster than smaller configurations. Eight-robot teams completed Push-T training in two hours compared to three hours for four-agent teams and nearly five hours for single agents working alone

1

. For pin insertion, scaling from one robot to eight reduced training time from more than 90 minutes to approximately 40 minutes

3

.

The robot stations share progress via Git, allowing winning strategies to spread fleet-wide within minutes

3

. However, researchers discovered crucial limitations. Physical robots often sat idle while coding agents were busy reading logs, writing code, debugging, or waiting for language model responses. Larger teams also spent more time summarizing each other's ideas rather than actively using robots, and sometimes failed to fully utilize available compute resources

1

.

Token Costs and Real-World Challenges

The faster success rates came at a significant cost: higher token consumption. The token bill grew faster than the time saved when scaling robot fleets

3

. This consideration matters as AI developers like Anthropic weigh pricing changes that would substantially increase token-related costs of AI services

1

.

The gap between simulation and reality also presented immediate challenges. While all three AI coding agents solved Push-T inside simulators, two of the three failed when the task moved to physical robots

3

. Simulators don't account for friction problems that real tables present. Despite these limitations, ENPIRE outperformed both Nvidia's own end-to-end model GR00T and CaP-X in RoboCasa, a simulated kitchen benchmark

3

.

Open Source Release and Industry Implications

Source: Decrypt

Source: Decrypt

Fan announced that the team would open-source everything, enabling anyone to host their own self-running robot lab at home

1

. The research paper detailing technical specifications was uploaded on June 16, 2026

1

. Fan joked about the ultimate goal: "We all take a holiday and Jensen wouldn't even notice," referencing Nvidia founder and CEO Jensen Huang.

ENPIRE extends concepts Nvidia introduced with Eureka, a 2023 system that used language models to write reward functions for robots in simulators. ENPIRE moves that self-improvement loop onto real hardware, with agents designing their own tests rather than just rewards

3

. The release coincides with Alibaba's unveiling of the Qwen-Robot Suite, signaling that embodied-AI and physical robots are becoming the next competitive arena for AI development

3

. Nvidia continues pushing its physical AI vision through partnerships, including a May 31 announcement with Chinese robotics company Unitree and discussions with Hyundai Motor about scaling robotics manufacturing

1

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved