Apple Pioneers New Training Method for Humanoid Robots Using Vision Pro and Human Demonstrations

Reviewed byNidhi Govil

2 Sources

Apple researchers have developed a novel approach to training humanoid robots by combining human demonstrations captured through Apple Vision Pro with traditional robot data, potentially revolutionizing the field of robotics.

Apple's Innovative Approach to Robot Training

In a groundbreaking study titled "Humanoid Policy ~ Human Policy," Apple researchers have introduced a novel method for training humanoid robots that could revolutionize the field of robotics 12. The research, conducted in collaboration with MIT, Carnegie Mellon, the University of Washington, and UC San Diego, explores the use of first-person footage of human demonstrations to train general-purpose robot models.

The PH2D Dataset and HAT Model

Source: 9to5Mac

Source: 9to5Mac

At the heart of this innovation is the Physical Human-Humanoid Data (PH2D) dataset, comprising over 25,000 human demonstrations and 1,500 robot demonstrations 1. This data is processed by a unified AI policy called the Human-humanoid Action Transformer (HAT), which can control a real humanoid robot in the physical world 2.

The HAT model is designed to learn a single policy that generalizes across both human and robot bodies, making the system more flexible and data-efficient. This shared training approach has shown promising results, enabling robots to handle more challenging tasks, including ones they hadn't encountered before 1.

Leveraging Apple Vision Pro for Data Collection

To collect the training data, the team developed an innovative application for the Apple Vision Pro 1. The app captures video from the device's bottom-left camera and utilizes Apple's ARKit to track 3D head and hand motion 2. This setup allows for high-quality demonstrations to be recorded in seconds, a significant improvement over traditional robot tele-operation methods.

Cost-Effective Alternatives

Recognizing the need for more affordable solutions, the researchers also explored using modified consumer products. They 3D-printed a mount to attach a ZED Mini Stereo camera to other headsets, such as the Meta Quest 3, offering similar 3D motion tracking capabilities at a lower cost 12.

Overcoming Human-Robot Speed Differences

An interesting challenge the researchers faced was the speed disparity between human and robot movements. To address this, they slowed down the human demonstrations by a factor of four during training, allowing the robot to keep pace without requiring further adjustments 1.

Improved Performance and Generalization

The study suggests that this combined training strategy offers significant benefits. Robots trained using this approach demonstrated better results in select tasks, such as vertical object grasping, compared to those trained exclusively with robot demonstrators 2.

Future Implications

Source: AppleInsider

Source: AppleInsider

While Apple has only publicly demonstrated a robot-lamp prototype so far, rumors suggest the company is working on a mobile robot for consumers that could perform household chores and simple tasks 2. This research could pave the way for more advanced and versatile humanoid robots in the future.

Conclusion

Apple's research represents a significant step forward in robotics training, potentially making the development of humanoid robots more scalable and cost-effective. By combining human demonstrations with traditional robot data, this approach could accelerate progress in the field and bring us closer to the reality of general-purpose humanoid robots in our daily lives.

Explore today's top stories

Google Unveils Pixel 10 Series: AI-Powered Smartphones with Enhanced Features and Capabilities

Google launches its new Pixel 10 series, featuring improved AI capabilities, enhanced camera systems, and the new Tensor G5 chip. The lineup includes the base Pixel 10, Pixel 10 Pro, Pixel 10 Pro XL, and Pixel 10 Pro Fold, all showcasing Google's commitment to AI-driven smartphone technology.

Ars Technica logoTechCrunch logoCNET logo

70 Sources

Technology

21 hrs ago

Google Unveils Pixel 10 Series: AI-Powered Smartphones with

Google Unveils AI-Powered Pixel 10 Smartphones with Gemini Integration

Google launches its new Pixel 10 smartphone series, featuring advanced AI capabilities powered by Gemini, aiming to challenge competitors in the premium handset market.

Bloomberg Business logoThe Register logoReuters logo

24 Sources

Technology

21 hrs ago

Google Unveils AI-Powered Pixel 10 Smartphones with Gemini

Google Unveils Pixel Watch 4: AI-Powered Features and Curved Display Redefine Smartwatch Experience

Google's latest Pixel Watch 4 introduces a curved display, AI-powered health coaching, and satellite communication, setting new standards in the smartwatch market.

TechCrunch logoCNET logoThe Verge logo

19 Sources

Technology

21 hrs ago

Google Unveils Pixel Watch 4: AI-Powered Features and

FieldAI Secures $405M Funding to Revolutionize Robotics with Universal AI Brains

FieldAI, an Irvine-based startup, has raised $405 million to develop "foundational embodied AI models" for various robots, aiming to create adaptable and safe AI systems for real-world applications.

TechCrunch logoReuters logoGeekWire logo

8 Sources

Technology

21 hrs ago

FieldAI Secures $405M Funding to Revolutionize Robotics

Microsoft AI CEO Warns Against the Dangers of 'Seemingly Conscious AI'

Mustafa Suleyman, CEO of Microsoft AI, cautions about the risks of AI systems that appear conscious, urging the industry to avoid creating illusions of sentience in AI products.

CNET logoTechRadar logoSiliconANGLE logo

5 Sources

Technology

21 hrs ago

Microsoft AI CEO Warns Against the Dangers of 'Seemingly
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo