Google Gemini Set to Introduce Video Upload and Generation Features

3 Sources

Share

Google's AI assistant Gemini is poised to expand its capabilities with new video-related features, including video upload analysis and AI-powered video generation, as revealed by recent APK teardowns.

News article

Google Gemini Prepares to Launch Video Upload and Generation Features

Google's AI assistant, Gemini, is on the verge of introducing two significant video-related features, as uncovered by recent APK teardowns. These developments could potentially close the gap between Gemini and its competitor, ChatGPT, while expanding the AI's capabilities in handling and creating video content.

Video Upload and Analysis Feature

Evidence from the latest beta version of the Google app (16.9.39.sa.arm64) suggests that Gemini will soon support video uploads for analysis

1

2

. Currently, Gemini can process various file types, including web pages, images, PDFs, and YouTube video links, but direct video file uploads are not yet possible

2

.

The discovered code strings reference "attached video file," "play video file," and "video file length," indicating that users may soon be able to upload videos directly to Gemini's chat interface

2

3

. While the feature is still under development, researchers were able to attach videos to Gemini's chat, although the AI was not yet capable of analyzing the content

2

.

AI-Powered Video Generation

In addition to video analysis, evidence suggests that Gemini is developing an AI-powered video generation feature, codenamed "Toucan"

1

. This feature would allow users to create videos by simply providing text descriptions of their ideas

1

.

Key points about the video generation feature include:

  1. Potential daily limits on video generation, as indicated by a code string mentioning a "Toucan generation limit"

    1

    .
  2. The possibility of a dedicated AI model for video generation, potentially utilizing Google's Veo 2 model introduced last year

    1

    .
  3. An estimated processing time of a few minutes for user prompts

    1

    .

Implications and Potential Applications

The introduction of these features could significantly enhance Gemini's functionality and user experience. Potential applications include:

  1. Summarizing recorded lectures or meetings
  2. Analyzing security camera footage
  3. Assessing social media content
  4. Troubleshooting technical issues through screen recordings

    2

Rollout and Availability

While exact release dates for these features remain unknown, Google's recent pattern of frequent Gemini updates suggests they may arrive sooner rather than later

2

. It's unclear whether these features will be available to all users or limited to Gemini Advanced subscribers

1

2

.

As Google continues to expand Gemini's capabilities and integrate it into various aspects of the Android ecosystem, these new video-related features represent another step in the ongoing development of AI assistants and their ability to process and generate multimedia content

2

3

.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo