Google Gemini switches to compute-based limits as user backlash forces quota increases

Reviewed byNidhi Govil

13 Sources

Share

Google announced a fundamental shift in how it calculates Gemini usage limits, moving from daily prompt counts to a compute-based system that factors in task complexity. The change sparked immediate user frustration, particularly among developers using Antigravity for coding. Google responded by tripling Antigravity limits twice within days and introducing Gemini 3.5 Flash Low, but many users claim quotas remain lower than before.

Google Gemini moves to compute-based usage limits

Google has fundamentally altered how it measures consumption across its Gemini AI service, shifting from a daily prompt-based system to a compute-based system that calculates usage based on task complexity, features used, and chat length

2

. The change, announced at I/O 2026, brings Google Gemini in line with competitors like ChatGPT and Claude, which have long used token consumption models rather than simple prompt counts

2

.

Source: Gadgets 360

Source: Gadgets 360

Under the new compute-used model, activities like video generation, deep research, and coding consume significantly more resources than basic text prompts

2

. The system now features both five-hour and weekly usage limits—users who consume too much compute hit a five-hour limit first, with weekly caps following

2

. Premium models and features, including media generation for images, videos, and music, as well as Deep Research Pro Model with extended thinking capabilities, eat into Gemini quota faster than standard tasks

2

.

Tiered plans offer vastly different compute allowances

The new structure creates stark differences between subscription tiers. Users without a plan receive standard limits, while the AI Plus plan at $8 per month provides 2x standard capacity

2

4

. The $20-per-month AI Pro plan delivers 4x higher limits, and the newly introduced $100 AI Ultra plan offers 20x the standard usage compared to free tier access

2

4

. AI Pro plan subscribers and Ultra users can purchase pay-as-you-go AI Credits to bypass limits once they're reached, with these credits working across Google Antigravity, Google Flow, and the Gemini app

2

.

Antigravity users hit limits within hours, sparking immediate backlash

The implementation of token quotas triggered immediate user frustration, particularly among developers relying on Google's AI-powered coding tool Antigravity for software engineering tasks

3

5

. Some Antigravity users discovered they could hit their limits within just an hour of working, a drastic reduction from previous allowances

3

. On Reddit, frustrated paid subscribers accused Google of executing a bait-and-switch, claiming the tighter Gemini usage limits made the AI Pro plan feel unnecessarily restrictive for users actively paying for the service

5

.

Source: 9to5Google

Source: 9to5Google

Google responds with triple quota increases and new model

Responding to the backlash, Google tripled Gemini model rate limits for Antigravity on Wednesday and reset weekly quotas for all paid plans

3

. When user frustration persisted, Google tripled the weekly quota again just days later—a 9x total increase across both adjustments

3

1

. Varun Mohan, a Director within DeepMind working on Antigravity, acknowledged users could hit weekly limits "after a couple work sessions" before announcing the second increase

3

.

Google also introduced Gemini 3.5 Flash Low, a new model designed to consume even fewer tokens than the successful Gemini 3.5 Flash

1

. When users hit their limits, Google now switches them to a smaller model automatically so they can continue working

2

. However, the higher quotas apply only inside Antigravity, while broader Gemini usage caps across other tools remain unchanged

5

.

Source: Android Authority

Source: Android Authority

Industry-wide struggle with agentic AI demands

Google's adjustments reflect broader challenges facing AI providers as they grapple with increasingly powerful agentic features that can spawn sub-agents consuming tens of thousands of tokens over multiple turns from a single request

4

. GitHub recently overhauled its Copilot plans, switching from "premium request units" to AI Credits based on actual tokens used

4

. Anthropic doubled Claude Code limits for its Claude Pro and Max plans after securing additional compute capacity through a deal with SpaceX, with an executive admitting current plans "weren't built" for features like Claude Code and Cowork

4

.

Despite Google's rapid response with multiple quota increases, many users maintain that current limits remain lower than what was available before the original changes, suggesting the backlash may continue as developers and power users evaluate whether the Gemini Pro plan still meets their needs for coding, deep research, and extended workflows

5

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved