-
Notifications
You must be signed in to change notification settings - Fork 18.1k
[BUG] Non-deterministic quota accounting: 65% of 5-hour session consumed with minimal actual token usage — Max $100 plan #29000
Description
Environment
- Plan: Max ($100/month, 5x usage)
- OS: macOS (Darwin 25.3.0)
- Date: February 26, 2026
Problem
My 5-hour session quota jumped from ~20% to 65% within a handful of message turns. ccusage data shows modest actual token consumption that does not justify consuming 65% of my session window.
Evidence
ccusage output (2026-02-26):
| Date | Models | Calls | Input | Output | Cache Creation | Cache Read |
|---|---|---|---|---|---|---|
| 2026-02-26 | haiku-4-5, opus-4-6, sonnet-4-6 | 1,300 | 9,421 | ~2,150K | ~29,696K | ~31,857K |
Dashboard (claude.ai):
- Current session: 65% used
- Resets in 30 min
The numbers do not add up. How does ~9,421 input tokens and ~2.15M output tokens translate to 65% of a $100/month Max plan's 5-hour window? Where is the remaining 45% being attributed to?
This directly corroborates the findings in #22435, which demonstrated a 1,500x variance in tokens-per-1%-quota (2,517 to 18,531,900) through 5,396 mitmproxy samples. The quota accounting system is either broken or deliberately opaque.
This is not an isolated incident
This exact problem has been reported and documented for over two months across numerous issues:
| Issue | Upvotes | Description | Anthropic Response |
|---|---|---|---|
| #16157 | 536 | Instantly hitting limits with Max subscription | "Limits haven't changed" |
| #23706 | 59 | Opus 4.6 consumes 3x more tokens than 4.5 | ❌ None |
| #22435 | - | 10x burn rate variance with mitmproxy proof | ❌ None |
| #22876 | - | 429 errors despite dashboard showing available quota | ❌ None |
| #20767 | - | Pro quota reduced to ~20 prompts per 5 hours | ❌ None (marked stale) |
| #28927 | - | Silent billing change in v2.1.51 routing to Extra Usage | ❌ None |
| #17084 | 237 | Most restrictive limits since Opus 4.5 launch | ❌ None |
The pattern is clear: hundreds of users, months of data, zero meaningful response.
The only official statement
The sole response from Anthropic (ThariqS, Jan 26) was:
"Rate limits have not changed since Opus 4.5 launch."
This is contradicted by:
- Empirical data showing 10x burn rate variance on the same account, same day ([BUG] Inconsistent and Undisclosed Quota Accounting Changes in Claude Max Plan. Legal liability claim ready. #22435)
- Community-estimated ~60% reduction in effective token limits (The Register)
- The automatic upgrade to Opus 4.6, which benchmarks show generates 3x more tokens for identical tasks — effectively a stealth limit reduction without touching the "rate limit" number
- Silent billing changes in v2.1.51 routing 200K+ context calls to Extra Usage ([BUG] Silent billing change in v2.1.51: 1M context moved to extra-usage-only without notice — JSONL data proves identical workload billed differently #28927)
Saying "limits haven't changed" while deploying a model that consumes 3x more tokens against unchanged limits is a limit reduction. This is not a technical nuance — it's basic arithmetic.
A word on "Anthropic"
The name "Anthropic" derives from the Greek anthropikos — "of or pertaining to humanity." The company's stated mission is to build AI that is "safe, beneficial, and understandable."
What is neither beneficial nor understandable:
- Charging $100–200/month (Max plan) for a service with undisclosed, non-deterministic quota accounting
- Ignoring months of documented evidence from paying customers, including mitmproxy captures, JSONL billing analysis, and API header data
- Marking legitimate bug reports as "stale" instead of addressing them
- Auto-upgrading users to a 3x more token-hungry model without disclosure or consent, then claiming "limits haven't changed"
- Silently changing billing behavior in patch updates (v2.1.51) without changelog entries
- Moderating criticism on Discord while issues pile up on GitHub unanswered
When developers present 5,396 API samples proving quota inconsistency and the response is silence — or worse, dismissal as "resumption of normal limits" — this is not a company living up to its name. This is consumer deception.
Users in #22435 have already outlined potential violations under UCC 2-313, FTC Act Section 5, and California Business & Professions Code 17200. The legal exposure grows with every week of silence.
What we need — not want, need
- Publish the quota consumption formula. If quota is based on something other than raw token count, disclose it. Today.
- Explain the variance. Why does 1% quota cost 2,517 tokens in one session and 18,531,900 in another?
- Real-time quota visibility in Claude Code CLI — not a hidden settings page on claude.ai
- Opt-in model upgrades. Do not auto-upgrade to models that burn quota 3x faster
- Acknowledge the problem exists. Silence is not a strategy. It is a choice, and it speaks louder than any PR statement.
Related Issues
#16157 #17084 #20767 #22435 #22439 #22876 #23706 #27310 #28927