Google DeepMind Unveils Gemini Robotics On-Device: A Leap Towards Autonomous AI-Powered Robots

Reviewed byNidhi Govil

5 Sources

Google DeepMind has released a new on-device AI model for robotics that can operate without cloud connectivity, marking a significant advancement in autonomous robot control and adaptability.

Google DeepMind Introduces Gemini Robotics On-Device

Google DeepMind has unveiled a groundbreaking advancement in artificial intelligence for robotics with the release of Gemini Robotics On-Device, a new AI model capable of running directly on robotic hardware without requiring an internet connection 12. This development marks a significant step towards creating more autonomous and adaptable robots for various applications.

Key Features and Capabilities

The Gemini Robotics On-Device model is a vision-language-action (VLA) system that builds upon the previously released Gemini Robotics model. It offers several notable features:

  1. Local Processing: Unlike its predecessor, which used a hybrid approach combining on-device and cloud-based processing, the new model operates entirely on the robot itself 3.

  2. Offline Functionality: The model enables robots to function in environments with poor or no internet connectivity, making it suitable for use in remote locations or areas with strict security requirements 4.

  3. Rapid Adaptation: According to Carolina Parada, head of robotics at Google DeepMind, the model can adapt to new tasks with as few as 50 to 100 demonstrations 23.

  4. Versatility: Initially trained on Google's ALOHA robot, the model has been successfully adapted to other robot types, including the humanoid Apollo robot from Apptronik and the bi-arm Franka FR3 robot 3.

Performance and Applications

Source: Digit

Source: Digit

Google claims that the on-device model performs at a level close to the cloud-based Gemini Robotics model, outperforming other on-device models in general benchmarks 2. Demonstrations have shown robots running this local model performing tasks such as:

  • Unzipping bags
  • Folding clothes
  • Tying shoelaces
  • Pouring liquids 15

The model's ability to generalize and handle new situations makes it particularly promising for applications in manufacturing, logistics, and industrial automation 5.

Development Tools and Safety Measures

To facilitate further development and customization, Google is releasing a Gemini Robotics SDK. This toolkit allows developers to evaluate and fine-tune the model for specific use cases 3. The company is also prioritizing safety in the deployment of this technology:

  1. Multi-layered Approach: The full Gemini Robotics system incorporates reasoning about safe actions, option generation, and low-level controllers for critical safety components 1.

  2. Safety Recommendations: For the on-device model, Google suggests that developers implement safety measures similar to those in the full system, including connecting to the Gemini Live API for an additional safety layer 1.

  3. Semantic Safety Benchmark: The system is being evaluated using a new semantic safety benchmark under the guidance of Google's Responsibility & Safety Council 5.

Future Implications and Industry Context

Source: Interesting Engineering

Source: Interesting Engineering

The release of Gemini Robotics On-Device represents a significant advancement in the field of AI-powered robotics. As the technology continues to evolve, it could have far-reaching implications for various industries:

  1. Manufacturing and Logistics: The model's ability to adapt quickly to new tasks and environments could revolutionize production lines and warehouse operations 5.

  2. Healthcare: Local processing of visual data enhances privacy, making the technology more suitable for sensitive environments like hospitals 1.

  3. Remote Operations: The offline functionality opens up possibilities for robotic applications in areas with limited connectivity, such as disaster response or space exploration 4.

Source: The Verge

Source: The Verge

As AI continues to advance in the robotics field, other companies are also making strides. Nvidia is developing foundation models for humanoids, while startups like Hugging Face and RLWRLD are working on open models and datasets for robotics 2.

With the Gemini Robotics On-Device model and SDK currently available to a group of trusted testers, the broader impact of this technology on the robotics industry and various sectors remains to be seen as development and safety assessments continue 3.

Explore today's top stories

Landmark Ruling: AI Training on Purchased Books Deemed Fair Use, but Piracy Concerns Linger

A federal judge rules that AI companies can train models on legally acquired books without author permission, marking a significant victory for AI firms. However, the use of pirated materials remains contentious and subject to further legal scrutiny.

Ars Technica logoTechCrunch logoWired logo

34 Sources

Policy and Regulation

8 hrs ago

Landmark Ruling: AI Training on Purchased Books Deemed Fair

UK Regulator Proposes New Rules to Curb Google's Search Dominance

The UK's Competition and Markets Authority (CMA) is considering designating Google with "strategic market status," which could lead to new regulations on its search engine operations, including fair ranking measures and increased publisher control over content use in AI-generated results.

Ars Technica logoTechCrunch logoBloomberg Business logo

22 Sources

Policy and Regulation

16 hrs ago

UK Regulator Proposes New Rules to Curb Google's Search

OpenAI Challenges Tech Giants with New ChatGPT Productivity Features

OpenAI is developing collaboration features for ChatGPT, potentially rivaling Google Docs and Microsoft Word, as it aims to transform the AI chatbot into a comprehensive productivity tool.

Economic Times logoPYMNTS logoInvesting.com logo

3 Sources

Technology

8 hrs ago

OpenAI Challenges Tech Giants with New ChatGPT Productivity

Google Donates Agent2Agent Protocol to Linux Foundation, Advancing AI Interoperability

Google has donated its Agent2Agent (A2A) protocol to the Linux Foundation, aiming to establish open standards for AI agent interoperability across platforms and vendors.

InfoWorld logoBleeping Computer logoAnalytics India Magazine logo

4 Sources

Technology

16 hrs ago

Google Donates Agent2Agent Protocol to Linux Foundation,

Amazon's Massive AI Data Center: Project Rainier Reshapes Computing Landscape

Amazon is building a colossal AI-focused data center complex in Indiana, part of its Project Rainier initiative, to power AI startup Anthropic. This marks a new era of supersized data centers for AI computing.

The New York Times logoEconomic Times logo

2 Sources

Technology

8 hrs ago

Amazon's Massive AI Data Center: Project Rainier Reshapes
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo