Skip to content

[BUG] Non-deterministic quota accounting: 65% of 5-hour session consumed with minimal actual token usage — Max $100 plan #29000

@gudcks0305

Description

@gudcks0305

Environment

  • Plan: Max ($100/month, 5x usage)
  • OS: macOS (Darwin 25.3.0)
  • Date: February 26, 2026

Problem

My 5-hour session quota jumped from ~20% to 65% within a handful of message turns. ccusage data shows modest actual token consumption that does not justify consuming 65% of my session window.

Evidence

ccusage output (2026-02-26):

Date Models Calls Input Output Cache Creation Cache Read
2026-02-26 haiku-4-5, opus-4-6, sonnet-4-6 1,300 9,421 ~2,150K ~29,696K ~31,857K

Dashboard (claude.ai):

  • Current session: 65% used
  • Resets in 30 min

The numbers do not add up. How does ~9,421 input tokens and ~2.15M output tokens translate to 65% of a $100/month Max plan's 5-hour window? Where is the remaining 45% being attributed to?

This directly corroborates the findings in #22435, which demonstrated a 1,500x variance in tokens-per-1%-quota (2,517 to 18,531,900) through 5,396 mitmproxy samples. The quota accounting system is either broken or deliberately opaque.

This is not an isolated incident

This exact problem has been reported and documented for over two months across numerous issues:

Issue Upvotes Description Anthropic Response
#16157 536 Instantly hitting limits with Max subscription "Limits haven't changed"
#23706 59 Opus 4.6 consumes 3x more tokens than 4.5 ❌ None
#22435 - 10x burn rate variance with mitmproxy proof ❌ None
#22876 - 429 errors despite dashboard showing available quota ❌ None
#20767 - Pro quota reduced to ~20 prompts per 5 hours ❌ None (marked stale)
#28927 - Silent billing change in v2.1.51 routing to Extra Usage ❌ None
#17084 237 Most restrictive limits since Opus 4.5 launch ❌ None

The pattern is clear: hundreds of users, months of data, zero meaningful response.

The only official statement

The sole response from Anthropic (ThariqS, Jan 26) was:

"Rate limits have not changed since Opus 4.5 launch."

This is contradicted by:

  1. Empirical data showing 10x burn rate variance on the same account, same day ([BUG] Inconsistent and Undisclosed Quota Accounting Changes in Claude Max Plan. Legal liability claim ready. #22435)
  2. Community-estimated ~60% reduction in effective token limits (The Register)
  3. The automatic upgrade to Opus 4.6, which benchmarks show generates 3x more tokens for identical tasks — effectively a stealth limit reduction without touching the "rate limit" number
  4. Silent billing changes in v2.1.51 routing 200K+ context calls to Extra Usage ([BUG] Silent billing change in v2.1.51: 1M context moved to extra-usage-only without notice — JSONL data proves identical workload billed differently #28927)

Saying "limits haven't changed" while deploying a model that consumes 3x more tokens against unchanged limits is a limit reduction. This is not a technical nuance — it's basic arithmetic.

A word on "Anthropic"

The name "Anthropic" derives from the Greek anthropikos — "of or pertaining to humanity." The company's stated mission is to build AI that is "safe, beneficial, and understandable."

What is neither beneficial nor understandable:

  • Charging $100–200/month (Max plan) for a service with undisclosed, non-deterministic quota accounting
  • Ignoring months of documented evidence from paying customers, including mitmproxy captures, JSONL billing analysis, and API header data
  • Marking legitimate bug reports as "stale" instead of addressing them
  • Auto-upgrading users to a 3x more token-hungry model without disclosure or consent, then claiming "limits haven't changed"
  • Silently changing billing behavior in patch updates (v2.1.51) without changelog entries
  • Moderating criticism on Discord while issues pile up on GitHub unanswered

When developers present 5,396 API samples proving quota inconsistency and the response is silence — or worse, dismissal as "resumption of normal limits" — this is not a company living up to its name. This is consumer deception.

Users in #22435 have already outlined potential violations under UCC 2-313, FTC Act Section 5, and California Business & Professions Code 17200. The legal exposure grows with every week of silence.

What we need — not want, need

  1. Publish the quota consumption formula. If quota is based on something other than raw token count, disclose it. Today.
  2. Explain the variance. Why does 1% quota cost 2,517 tokens in one session and 18,531,900 in another?
  3. Real-time quota visibility in Claude Code CLI — not a hidden settings page on claude.ai
  4. Opt-in model upgrades. Do not auto-upgrade to models that burn quota 3x faster
  5. Acknowledge the problem exists. Silence is not a strategy. It is a choice, and it speaks louder than any PR statement.

Related Issues

#16157 #17084 #20767 #22435 #22439 #22876 #23706 #27310 #28927

Metadata

Metadata

Labels

area:costbugSomething isn't workingduplicateThis issue or pull request already existsplatform:macosIssue specifically occurs on macOS

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions