4 Sources
[1]
Early look: Gemini Omni generates realistic AI video in new leak
Metadata suggests the model is an extension of Google's "Veo," and user reports indicate it carries heavy usage limits on the Google AI Pro plan. Google I/O 2026 is next week, and we're already seeing glimpses of upcoming features. Gemini users have spotted the new animation on iOS and Android apps, but that's not all. A user now reports spotting references to a new "Gemini Omni" model, which could be Google's next-generation AI video model. Reddit user Zacatac_391 opened their Gemini app and got a pop-up suggesting they "create with Gemini Omni," Google's "new video model" (h/t 9to5Google). This video is not perfect, but still impressive on several counts. The Omni model not only got the reasoning correct but also generated a video that is very lifelike and reasonably accurate. There are some tells that it's an AI video, like some of the writing actions not matching the chalk output and the vanishing chalk towards the end (which shows inconsistency), but overall, it's still pretty good. This video is a little less impressive. The spaghetti appears out of thin air on empty plates, and there's not enough chewing happening for the bites taken. For reference, Reddit user janekm3 tried the same prompt with ByteDance's Seedance 2 and got the following result: Seedance 2's output appears more consistent, though the video jitters on my end. Reddit user Zacatac_391 notes that they've spotted the new section for usage showing precise rolling limits. The user had some Flash usage, but beyond that, with just two video generation requests on the Google AI Pro plan, they reached 86% of their daily usage limit. Max Weinbach on X found metadata suggesting that Omni is an extension of Veo. We'll have to wait for Google's official announcement to learn more about Gemini Omni and how it improves upon Veo.
[2]
Gemini 'Omni' video model shows up with some early demos
A new video generation model is apparently coming to Gemini, with "Omni" producing some pretty impressive inital results. Video generation is perhaps one the most impressive, but also most polarizing aspects of generative AI. Google has built out Veo as its video generation model for a while now, but seems to have something new in the pipeline. At least one Gemini user was prompted to "Create with Gemini Omni," which Google describes as follows: Meet our new video generation model. Remix your videos, edit directly in chat, try a template, and more. How "Omni" fits into the broader context of Gemini and Veo isn't entirely clear at the moment, but metadata suggests "Omni" is an extension of Veo. But, regardless of that, the output here looks pretty impressive. One demo used input of "a professor writes out a mathematical proof for trigonometric identities on a traditional chalkboard, explaining the step he is currently on in the equation," and while there are still some obvious tells in the final output, the video does a great job of handling the text while putting out a fairly realistic video. Meanwhile, a second prompt asked for a scene of two men eating spaghetti - in reference to the Will Smith test - again with fairly realistic results. It's nothing entirely groundbreaking, but the output is quite good. The prompt here was: "Can you create a scene with two men at a table seaside at an upscale restaurant on outdoor deck seating. They are at a circular table with a nice white table cloth, and all of the fancy accessories, all the spoons forks and knives, fancy napkins, centerpiece. One man is Distinguished: A mature African-American man in his 50s with a short beard and confident posture, wearing a tailored, sophisticated suit, the other is is friend, both approaching the table to eat a plate of spaghetti. In the beginning the men approach the table, exchange brief niceties, and begin to eat the spaghetti calmly In between bites sharing conversation." A "usage" tab also showed up for this user, with these two prompts taking up 86% of daily usage on an AI Pro plan (though the user did say some usage on Gemini Flash during the same day). We recently spotted Google's intention to add more explicit usage limits. Google hasn't announced Gemini "Omni" yet, but previously said that "video's here to stay" in commitment to the technology following the announcement that OpenAI would kill off video generation through its Sora model earlier this year. With I/O 2026 right around the corner, that's probably where we'll hear more about Google's plans for Gemini and video generation.
[3]
What the Leaked Gemini Omni Demo Reveals About Google's AI Future
Google's Gemini Omni has emerged as a focal point in the AI community following a series of leaks that hint at its advanced capabilities. According to AI Grid, the model is expected to unify text, image and video generation into a single framework, with early demos showcasing its real-time multi-modal integration. However, the leaked details also highlight significant challenges, such as its high computational demands, which could limit accessibility for smaller creators. These trade-offs have sparked widespread discussion about whether Omni can balance its technical achievements with practical usability. Dive into this exposé to explore the implications of Omni's potential release. Gain insight into how its rumored features, such as real-time video generation and cross-platform integration, could reshape creative workflows. Understand the cost-performance dynamics that may influence its adoption and see how Omni compares to competitors like Sea Dance 2 and Cling 3.0. With Google I/O 2026 on the horizon, this breakdown offers a timely lens into what Omni might mean for the future of AI. The excitement surrounding Gemini Omni began when users of the Gemini app noticed a "powered by Omni" label, hinting at the model's integration into existing platforms. Shortly after, videos allegedly generated by Omni surfaced online, showcasing its remarkable video generation capabilities. These clips demonstrated a level of quality and realism that suggests Omni could represent a significant leap forward in AI technology. Although Google has not officially confirmed these details, the leak offers a tantalizing preview of what the model might achieve. The leaked videos have fueled speculation about Omni's potential to outperform existing models in terms of multi-modal functionality. This early exposure has set high expectations, with many eagerly awaiting further details at Google I/O 2026. Gemini Omni appears to unify multiple modalities, text, image and video, into a single, cohesive framework. This integration could simplify workflows for developers, content creators and businesses by reducing the need for separate tools. Unlike its predecessors, Omni is rumored to support real-time multi-modal generation, a feature that could set it apart from competitors like Sea Dance 2, Alibaba W 2.7 and Cling 3.0. If these claims hold true, Omni would represent a significant improvement over Google's current V3.1 model, which lacks seamless multi-modal integration. By offering a unified platform, Omni could streamline creative processes, making it an attractive option for professionals seeking efficiency and versatility in their AI tools. Learn more about Google Gemini with other articles and guides we have written below. While Omni's capabilities are undeniably impressive, they come with notable trade-offs. The model's high computational demands have already raised concerns about its accessibility and cost-effectiveness. For instance, users on the $20/month Pro plan reported that generating just two videos consumed 86% of their daily allowance. This suggests that Omni's operational costs could significantly exceed those of earlier models like V3.1 and Sora 2. For businesses and individual creators, the challenge will be finding a balance between Omni's performance benefits and its potential cost implications. Larger enterprises may find the model's capabilities worth the investment, but smaller creators could struggle to justify the expense. Addressing these cost barriers will be crucial for making sure widespread adoption. The AI industry is becoming increasingly competitive, with numerous advanced models vying for dominance. Sea Dance 2, for example, has set a high standard for video quality and multi-modal capabilities. If Omni delivers on its promises, it could position Google as a leader in AI video generation and multi-modal integration. However, success will depend on more than just technological innovation. Google must also address cost efficiency and usage limitations, as these factors will heavily influence adoption rates. By striking the right balance, Omni could carve out a unique position in the crowded AI landscape, appealing to both enterprise users and independent creators. The upcoming Google I/O event is expected to provide critical insights into Omni's role within Google's broader AI strategy. Industry experts have outlined three potential scenarios for Omni's positioning: The third scenario is the most ambitious and, if realized, could set a new benchmark for the AI industry. By consolidating multiple functionalities into a single platform, Omni could redefine how creators and developers interact with AI tools, offering unprecedented flexibility and efficiency. The concept behind Omni echoes the rumored "Omni" variant of GPT-4, which was never officially released but promised similar multi-modal capabilities. By integrating text, image and video generation into a unified model, Omni could fulfill the vision of an all-encompassing AI system. However, its high costs and usage limitations may present significant challenges, particularly for smaller creators and businesses. Striking a balance between innovation and accessibility will be critical for Omni's long-term success. If Google can address these challenges effectively, Omni could become a cornerstone of the AI industry, influencing the development of future models and applications. As the AI community eagerly awaits Google's official announcement, the potential impact of Gemini Omni is becoming increasingly clear. Whether it redefines multi-modal AI or serves as an incremental improvement, Omni is poised to influence the trajectory of AI development in meaningful ways. The upcoming Google I/O event will likely provide crucial details, clarifying how this new model fits into the broader AI landscape and what it means for the future of artificial intelligence. Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.
[4]
Gemini Omni leak reveals Google's next AI video tool ahead of I/O 2026
The feature may debut around Google I/O 2026, where Gemini and AI are expected to take center stage. Google may soon expand Gemini AI capabilities by introducing a new video generation system, reportedly called Omni. While the company has not officially announced the feature yet, early previews shared by some users suggest that Google is testing advanced AI-powered video creation directly inside Gemini. As per the reports, select Gemini users recently started seeing a Create with Gemini Omni option within the chatbot. The feature is described as a new video generation model capable of creating, editing and remixing videos via simple chat prompts. The users may also be able to use templates and modify videos directly inside the Gemini interface. It must be noted that Google has clarified how Omni fits alongside its existing Veo video generation technology; metadata reportedly indicates that the new system can be built on top of Veo. Early demo clips shared online suggest that the quality of generated videos is improving, specifically in handling realistic movement, facial expressions and text rendering. One example showcased a classroom-style scene featuring a professor solving trigonometry equations on a chalkboard while explaining the proof step by step. Another prompt created a cinematic restaurant scene involving two men eating spaghetti at a seaside dining setup. However, some AI generated imperfections were still visible but overall, the result looked more polished. The report also suggested that Gemini Omni may include daily usage limits depending on subscription plans. One user claimed that generating two detailed videos consumed a major portion of their daily AI Pro plans. Google has recently been exploring clearer usage tracking and restrictions for Gemini's advanced AI tools. This comes ahead of Google IO 2026, where the company is said to heavily focus on Gemini and AI. Google previously hinted that video creation would remain a major long-term focus for Gemini, especially after increasing competition in the generative AI space.
Share
Copy Link
Google's unannounced Gemini Omni video generation model has surfaced through early user demos, showing impressive capabilities in creating realistic AI video content. The leaked footage reveals an AI-powered video generation tool that handles complex prompts with lifelike results, though users report hitting 86% of their daily limits after just two videos on the $20-per-month Google AI Pro plan.
Google appears ready to expand its AI video generation capabilities with a new model called Gemini Omni, which has surfaced through early user access just days before Google I/O 2026. Select Gemini chatbot users discovered a prompt inviting them to "Create with Gemini Omni," described by Google as a new video generation model that enables users to remix videos, edit directly in chat, and work with templates
1
2
. The timing suggests Google may unveil this AI-powered video generation tool at its flagship developer conference next week, where Gemini and AI innovations are expected to dominate the agenda4
.
Source: Digit
Metadata analysis indicates that Gemini Omni functions as an extension of Veo, Google's existing video generation technology, though the company has not clarified exactly how the two systems relate
1
4
. This leaked Gemini Omni demo arrives as Google doubles down on its commitment to video creation within generative AI, particularly following OpenAI's decision to discontinue its Sora model earlier this year2
.The early demonstrations of Gemini Omni reveal notable progress in creating realistic AI video content. One user tested the model with a complex prompt requesting "a professor writes out a mathematical proof for trigonometric identities on a traditional chalkboard, explaining the step he is currently on in the equation"
2
. The resulting video generation model from Google produced footage that not only captured correct mathematical reasoning but also generated lifelike visuals with reasonably accurate text rendering1
.However, some AI-generated imperfections remain visible. In the chalkboard demonstration, certain writing actions didn't match the chalk output, and the chalk appeared to vanish inconsistently toward the end
1
. A second test involving two men eating spaghetti at a seaside restaurant showed the food appearing unexpectedly on empty plates, with insufficient chewing motions for the bites taken1
. When compared to ByteDance's Seedance 2 using the same prompt, that model produced more consistent output, though with noticeable video jitter1
.One of the most striking revelations from the leaked access involves usage limits on the Google AI Pro plan, which costs $20-per-month. A user reported that after generating just two videos with Gemini Omni, they had consumed 86% of their daily usage allowance
1
2
. This suggests significant computational demands behind the model's operation, raising concerns about cost-effectiveness and accessibility for individual creators and smaller businesses3
.
Source: Android Authority
The high consumption rate indicates that Google's next AI video tool may require substantially more resources than earlier models like Veo 3.1 and OpenAI Sora 2
3
. For businesses and independent creators, balancing Omni's performance benefits against potential cost implications will be critical. While larger enterprises may justify the investment, smaller creators could struggle with the expense, making widespread adoption dependent on how Google addresses these usage limits3
.Related Stories
Industry analysis suggests that Gemini Omni may unify text, image, and video generation into a single framework, offering real-time multi-modal integration that could distinguish it from competitors like Seedance 2, Alibaba's models, and Cling 3.0 . This consolidation could streamline creative workflows by eliminating the need for separate tools, making it attractive for developers and content creators seeking efficiency .

Source: Geeky Gadgets
Looking ahead to Google I/O 2026, experts have outlined three potential scenarios for how Omni fits into Google's AI future: as a standalone premium offering, as a replacement for existing Veo models, or as a unified platform consolidating multiple AI functionalities
3
. The third scenario represents the most ambitious path, potentially setting a new benchmark for how creators and developers interact with AI tools. The concept echoes the rumored multi-modal variant of GPT-4 that was never officially released, suggesting Google may be pursuing capabilities that competitors have explored but not fully delivered3
. As the AI video generation landscape grows increasingly competitive, success will depend not just on technological innovation but also on addressing cost efficiency and practical usability for diverse user segments.Summarized by
Navi
[1]
[2]
[3]
19 May 2026•Technology

18 May 2026•Technology

13 Feb 2025•Technology

1
Technology

2
Policy and Regulation

3
Policy and Regulation
