Google DeepMind Plans to Merge Gemini and Veo AI Models for Enhanced Real-World Understanding

2 Sources

Google DeepMind CEO Demis Hassabis reveals plans to combine Gemini AI with Veo video generator, aiming to create a universal digital assistant with improved real-world understanding. The move highlights the industry trend towards versatile "omni" AI models.

News article

Google DeepMind's Vision for a Universal Digital Assistant

Google DeepMind CEO Demis Hassabis has unveiled ambitious plans to merge the company's Gemini AI models with its Veo video-generating models. This strategic move aims to enhance Gemini's understanding of the physical world, bringing Google closer to its vision of creating a "universal digital assistant" 1.

During an appearance on the "Possible" podcast, co-hosted by LinkedIn co-founder Reid Hoffman, Hassabis explained, "We've always built Gemini, our foundation model, to be multimodal from the beginning, and the reason we did that [is because] we have a vision for this idea of a universal digital assistant, an assistant that [...] actually helps you in the real world" 2.

The Rise of "Omni" AI Models

The planned merger of Gemini and Veo reflects a broader industry trend towards developing versatile "omni" models capable of understanding and synthesizing multiple forms of media. These advanced AI systems can process and generate various types of content, including text, images, audio, and video 1.

Google's latest Gemini models already demonstrate multimodal capabilities, generating audio, images, and text. Similarly, OpenAI's ChatGPT now includes native image creation features. Amazon has also announced plans to launch an "any-to-any" model later this year, further highlighting the industry's direction 1.

Leveraging YouTube for AI Training

A key aspect of Google's strategy involves using YouTube as a primary source of training data for its AI models. Hassabis revealed that Veo 2, the latest iteration of their video-generating model, learns about real-world physics by processing vast amounts of YouTube content 2.

"Basically, by watching YouTube videos -- a lot of YouTube videos -- [Veo 2] can figure out, you know, the physics of the world," Hassabis explained 1. This approach allows the AI to gain a deeper understanding of real-world dynamics and interactions.

Data Usage and Privacy Considerations

Google's use of YouTube content for AI training raises questions about data usage and creator agreements. The company has previously stated that its models "may be" trained on "some" YouTube content, in accordance with its agreements with creators 1.

Reports suggest that Google broadened its terms of service last year, potentially to allow for expanded use of data in AI model training. This move highlights the ongoing debate surrounding data privacy and the ethical use of user-generated content in AI development 2.

Implications for the Future of AI

The planned integration of Gemini and Veo models represents a significant step towards creating more sophisticated and versatile AI systems. By combining language understanding with visual comprehension, Google aims to develop AI assistants that can better interact with and understand the physical world 12.

This advancement could lead to more intuitive and capable AI applications across various sectors, from personal assistance to industrial automation. However, it also underscores the need for continued discussions on data privacy, ethical AI development, and the potential societal impacts of increasingly advanced AI systems.

Explore today's top stories

Google Unveils Pixel 10 Series: AI-Powered Features and Camera Upgrades Take Center Stage

Google has launched its new Pixel 10 series, featuring improved AI capabilities, camera upgrades, and the new Tensor G5 chip. The lineup includes the Pixel 10, Pixel 10 Pro, and Pixel 10 Pro XL, with prices starting at $799.

Ars Technica logoTechCrunch logoCNET logo

60 Sources

Technology

14 hrs ago

Google Unveils Pixel 10 Series: AI-Powered Features and

Google Unveils AI-Powered Pixel 10 Smartphones with Advanced Gemini Features

Google launches its new Pixel 10 smartphone series, showcasing advanced AI capabilities powered by Gemini, aiming to compete with Apple in the premium handset market.

Bloomberg Business logoThe Register logoReuters logo

22 Sources

Technology

13 hrs ago

Google Unveils AI-Powered Pixel 10 Smartphones with

NASA and IBM Unveil Surya: An AI Model to Predict Solar Flares and Space Weather

NASA and IBM have developed Surya, an open-source AI model that can predict solar flares and space weather with improved accuracy, potentially helping to protect Earth's infrastructure from solar storm damage.

New Scientist logoengadget logoGizmodo logo

6 Sources

Technology

22 hrs ago

NASA and IBM Unveil Surya: An AI Model to Predict Solar

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered Wearables

Google's latest smartwatch, the Pixel Watch 4, introduces significant upgrades including a curved display, AI-powered features, and satellite communication capabilities, positioning it as a strong competitor in the smartwatch market.

TechCrunch logoCNET logoZDNet logo

18 Sources

Technology

13 hrs ago

Google Unveils Pixel Watch 4: A Leap Forward in AI-Powered

FieldAI Secures $405M Funding to Revolutionize Robot Intelligence with Physics-Based AI Models

FieldAI, a robotics startup, has raised $405 million to develop "foundational embodied AI models" for various robot types. The company's innovative approach integrates physics principles into AI, enabling safer and more adaptable robot operations across diverse environments.

TechCrunch logoReuters logoGeekWire logo

7 Sources

Technology

14 hrs ago

FieldAI Secures $405M Funding to Revolutionize Robot
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo