TL;DR
- Quota Revision: Google isrevising Gemini’s paid limits after complaints that one failed video task could consume a full five-hour window.
- Policy Change: Google now caps how much one Gemini 3.1 Pro request can use, and failed jobs no longer count against quota.
- Subscriber Test: AI Pro buyers still need the refreshed quota system to feel predictable when heavier Omni and video features are part of normal use.
Google is revising Gemini’s compute-based usage limits after complaints that expensive tasks could burn through quota too quickly.
One failed avatar-video request reported by a user on X exhausted his entire five-hour usage window, turning a subscription update into a direct product problem for Google AI Pro subscribers. Google Gemini lead Josh Woodward offered a public response, stating “Yikes, let us take a look!”
This is crazy man… one prompt + 4 minutes and I hit my 5 hour rate limit in the Gemini app. Here is the proof. Yesterday, I hit the rate limit the same way, so I wanted to check if it would happen again.
I started with 0% usage on my 5 hour limit, then gave one simple prompt… pic.twitter.com/pqlPZKRoek
— AshutoshShrivastava (@ai_for_success) May 25, 2026
Paying users had already seen Google push Gemini toward compute-based quotas through the wider subscription reset in May. The earlier Gemini 3.1 Pro rollout had also expanded the premium stack, leaving Google’s paid plans under pressure to make heavier workloads feel predictable enough for subscribers who use video and other costly tools.
How Gemini’s New Quota Rules Changed
Quota now depends on how demanding a task is, not just how many prompts a user sends. Google’s paid setup reflects prompt complexity and tool usage, with limits that refresh every five hours until a weekly cap is reached.
Under that model, Google is also limiting how much quota a single Gemini 3.1 Pro prompt can consume.
For paying users, failed requests now do not count against quota, a change that directly addresses the risk that an expensive attempt could wipe out hours of paid access without producing a usable result.
Google currently lists these three Google AI plans:
Across those tiers, Google AI Plus includes limited access to Gemini Omni Flash, while AI Pro broadens access across the Gemini app and related tools. AI Ultra adds higher limits plus Deep Think access.
Paid video generation sits near the center of the dispute because heavier creative tasks reveal the real cost of a compute-based plan much faster than an ordinary chat session.
For subscribers, that means quota math now shapes product value as much as the feature list. A plan can look generous on paper yet still feel restrictive if one failed generation drains most of a session before the user gets a result.
Product Rollout Context and Feature Pressure
Pressure on the new quota model rose after Google expanded Gemini’s heavier feature set in May. With the Gemini Omni launch for subscribers, paid users gained direct access to a multimodal video tool that can turn quota drain into a visible product failure.
In response, Google has fixed a bug that could let one or two Omni videos consume too much quota for some users. Google also doubled the number of Omni generations available to AI Ultra subscribers, a sign that the response reached beyond a wording change around limits and into capacity for the heaviest tier.
At the lower end of the stack, Flash-Lite prompts are free. Ashutosh Shrivastava’s description turns the policy change into a concrete usability test: a paid plan still has to let an AI Pro user finish a demanding task without losing an entire session to a failed run. That is especially important for mid-tier subscribers, who are paying for broader access but still face much tighter ceilings than AI Ultra users.
What Comes Next for Paid Gemini Use
At the top tier, AI Ultra advertises a 5X higher usage limit in the Gemini app and Google Antigravity than Google’s Pro plan. For Google, that gap creates room to push the heaviest workloads toward a pricier tier while leaving AI Pro buyers to judge whether the mid-tier plan remains practical.
Google’s latest fixes reduce the sharpest complaint by shielding failed jobs from quota loss and tightening how much one request can consume. AI Pro subscribers still need the five-hour window to feel usable in practice, especially when video features and other compute-heavy tools are now central to what Google is selling in Gemini.

