Google Gemini usage limits shift to compute-based

Google Gemini moves to compute-based usage limits

Google has fundamentally altered how it measures consumption across its Gemini AI service, shifting from a daily prompt-based system to a compute-based system that calculates usage based on task complexity, features used, and chat length 2

. The change, announced at I/O 2026, brings Google Gemini in line with competitors like ChatGPT and Claude, which have long used token consumption models rather than simple prompt counts 2

Source: Gadgets 360

Under the new compute-used model, activities like video generation, deep research, and coding consume significantly more resources than basic text prompts 2

. The system now features both five-hour and weekly usage limits—users who consume too much compute hit a five-hour limit first, with weekly caps following 2

. Premium models and features, including media generation for images, videos, and music, as well as Deep Research Pro Model with extended thinking capabilities, eat into Gemini quota faster than standard tasks 2

Tiered plans offer vastly different compute allowances

The new structure creates stark differences between subscription tiers. Users without a plan receive standard limits, while the AI Plus plan at $8 per month provides 2x standard capacity 2

. The $20-per-month AI Pro plan delivers 4x higher limits, and the newly introduced $100 AI Ultra plan offers 20x the standard usage compared to free tier access 2

. AI Pro plan subscribers and Ultra users can purchase pay-as-you-go AI Credits to bypass limits once they're reached, with these credits working across Google Antigravity, Google Flow, and the Gemini app 2

Antigravity users hit limits within hours, sparking immediate backlash

The implementation of token quotas triggered immediate user frustration, particularly among developers relying on Google's AI-powered coding tool Antigravity for software engineering tasks 3

. Some Antigravity users discovered they could hit their limits within just an hour of working, a drastic reduction from previous allowances 3

. On Reddit, frustrated paid subscribers accused Google of executing a bait-and-switch, claiming the tighter Gemini usage limits made the AI Pro plan feel unnecessarily restrictive for users actively paying for the service 5

Source: 9to5Google

Google responds with triple quota increases and new model

Responding to the backlash, Google tripled Gemini model rate limits for Antigravity on Wednesday and reset weekly quotas for all paid plans 3

. When user frustration persisted, Google tripled the weekly quota again just days later—a 9x total increase across both adjustments 3

. Varun Mohan, a Director within DeepMind working on Antigravity, acknowledged users could hit weekly limits "after a couple work sessions" before announcing the second increase 3

Google also introduced Gemini 3.5 Flash Low, a new model designed to consume even fewer tokens than the successful Gemini 3.5 Flash 1

. When users hit their limits, Google now switches them to a smaller model automatically so they can continue working 2

. However, the higher quotas apply only inside Antigravity, while broader Gemini usage caps across other tools remain unchanged 5

Source: Android Authority

Industry-wide struggle with agentic AI demands

Google's adjustments reflect broader challenges facing AI providers as they grapple with increasingly powerful agentic features that can spawn sub-agents consuming tens of thousands of tokens over multiple turns from a single request 4

. GitHub recently overhauled its Copilot plans, switching from "premium request units" to AI Credits based on actual tokens used 4

. Anthropic doubled Claude Code limits for its Claude Pro and Max plans after securing additional compute capacity through a deal with SpaceX, with an executive admitting current plans "weren't built" for features like Claude Code and Cowork 4

Despite Google's rapid response with multiple quota increases, many users maintain that current limits remain lower than what was available before the original changes, suggesting the backlash may continue as developers and power users evaluate whether the Gemini Pro plan still meets their needs for coding, deep research, and extended workflows 5

Google Gemini switches to compute-based limits as user backlash forces quota increases

Google Gemini moves to compute-based usage limits

Tiered plans offer vastly different compute allowances

Antigravity users hit limits within hours, sparking immediate backlash

Google responds with triple quota increases and new model

Industry-wide struggle with agentic AI demands

References

Google's latest attempt to fix token quotas is here: Say hello to Gemini 3.5 Flash Low

Google is changing how Gemini usage limits work

Google has tripled Gemini usage limits for Antigravity, twice

Google just made big changes to Gemini usage limits

Google gives Antigravity users another major Gemini quota boost as backlash refuses to die down

Related Stories

Google fixes Gemini usage limits after single prompt maxed out subscriber's entire quota

Google separates Gemini 3 usage limits, giving Pro and Thinking models independent quotas

Google Unveils Enhanced Gemini 2.5 Pro: A Leap Forward in AI Capabilities

Recent Highlights

Anthropic warns AI may soon build itself, calls for global pause on frontier development

Apple finally launches Siri AI overhaul with Google Gemini, two years after initial promise

AI-designed vaccine passes first human trial, offering broad protection against coronaviruses

Recent Highlights

Today's Top Stories

Jeff Bezos' Prometheus AI startup raises $12B at $41B valuation to build engineering tools

FIFA World Cup 2026 becomes most technologically advanced tournament with AI at its core

Mother Sues OpenAI Over ChatGPT's Role in Daughter's Death, Citing Deliberate Design Decisions

OpenAI considers drastic price cuts as AI price war with Anthropic intensifies