Google Expands Veo 3 AI Video Generation to Gemini App: Photos to Videos Now Possible

Reviewed byNidhi Govil

22 Sources

Share

Google has added a new feature to its Gemini AI app, allowing users to transform photos into short video clips using the Veo 3 AI video generator. This expansion brings advanced AI video creation capabilities to a broader audience.

Google Introduces Photo-to-Video Generation in Gemini App

Google has announced a significant update to the Gemini AI app, introducing the ability to transform static photos into dynamic video clips using the Veo 3 AI video generator. This feature, previously available in Google's Flow AI tool for filmmakers, is now being integrated into the more widely accessible Gemini platform

1

.

Feature Availability and Functionality

Source: Mashable

Source: Mashable

The new photo-to-video capability is being rolled out to Google AI Ultra and Pro plan subscribers in select regions. Users can access this feature through the web version of Gemini starting immediately, with mobile app availability expected throughout the week

5

.

To create a video, users select the "Video" option from the Gemini toolbar, upload a photo, and provide a text description of the desired animation. The system also allows for audio descriptions, including dialogue, sound effects, and ambient noise, which Google claims will be "perfectly synced with the visuals"

3

.

Technical Specifications and Limitations

The Veo 3-powered system generates 8-second video clips in MP4 format, with a resolution of 720p and a 16:9 landscape aspect ratio. Users are currently limited to three video creations per day, with no carry-over allowance

2

.

Google CEO Sundar Pichai reported that since Veo 3's launch seven weeks ago, users have created more than 40 million videos across the Gemini app and Flow tool, highlighting the rapid adoption of this technology

2

.

AI Detection and Ethical Considerations

Source: engadget

Source: engadget

In response to growing concerns about the authenticity of AI-generated content, Google has implemented both visible and invisible watermarking systems. All videos generated using the Veo 3 model will display a visible "Veo" watermark and include an invisible SynthID digital watermark

4

.

These measures aim to help identify AI-generated content, addressing potential misuse concerns. Google has also released a tool to detect content containing SynthID, further enhancing transparency in AI-generated media

2

.

Implications and Future Developments

Source: Android Police

Source: Android Police

The integration of Veo 3 into Gemini represents a significant step in making advanced AI video generation more accessible to a broader audience. This development has the potential to revolutionize content creation across various industries, from social media to digital marketing

1

.

However, the technology's rapid advancement also raises questions about the future of visual media authenticity. Recent incidents, such as the flood of AI-generated racist videos on TikTok allegedly created using Veo 3, underscore the need for robust ethical guidelines and detection mechanisms as these tools become more widespread

4

.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo