Question 1

How does the Claude Code 5-hour rolling window work?

Accepted Answer

The 5-hour rolling window tracks your usage over the most recent 5 hours continuously, not in fixed blocks. If you send 45 messages between 9am and 10am on the Pro plan, you hit your limit. But as time passes, older messages age out of the window. By 2pm, those messages are outside the 5-hour window and your capacity is fully restored. The window rolls continuously, so if you space your usage across 5 hours instead of concentrated in one hour, you can use Claude Code throughout the day without hitting limits.

Question 2

What happens when I reach my Claude Code usage limit?

Accepted Answer

When you reach your usage limit on a subscription plan, you are never charged extra. Instead, Claude Code may throttle your responses (making them slower), redirect you to a lighter model like Haiku, or temporarily pause responses until capacity frees up in your rolling window. On the Pro plan, throttling is more aggressive - you might wait several minutes between responses or be limited to shorter responses. On Max plans, throttling is more gradual and you maintain access to your chosen model for longer.

Question 3

Are Claude Code usage limits based on messages or tokens?

Accepted Answer

Usage limits are measured in usage units, not simple message counts. The approximate message numbers (45 for Pro, 225 for Max 5x, 900 for Max 20x) assume typical message complexity. Longer messages that involve reading large files or generating extensive code consume more usage units per message than short questions. This means a session of complex multi-file operations will hit the limit faster than a session of quick questions. The /cost command in Claude Code can help you track your current usage level.

Question 4

How do Claude Code API rate limits work?

Accepted Answer

API rate limits are separate from subscription usage limits and are measured in requests per minute (RPM) and tokens per minute (TPM). New API accounts start at Tier 1 with lower limits, and you can increase your tier by maintaining a consistent spending history. Tier 1 allows 50 RPM and 40,000 TPM for Sonnet. Tier 4 allows 4,000 RPM and 400,000 TPM. Rate limit increases happen automatically based on your account age and spending, or you can request a tier increase through the Anthropic console.

Question 5

Can I check my current usage level in Claude Code?

Accepted Answer

Yes, use the /cost command in Claude Code to see your current session costs and usage level. For subscription users, this shows how much of your current 5-hour window capacity you have consumed. For API users, it shows the total tokens and cost for your current session. Monitoring your usage regularly helps you develop an intuition for how quickly different types of tasks consume your allocation, making it easier to plan your work within the limits.

Plan	Approx Msgs / 5hr	Usage Multiplier	Context Window	Monthly Price
Pro	~45	1x	200K	$20
Max 5x	~225	5x	1M	$100
Max 20x	~900	20x	1M	$200

Tier	Requests/Min	Tokens/Min (Input)	Tokens/Min (Output)
Tier 1 (New)	50	40,000	8,000
Tier 2	1,000	80,000	16,000
Tier 3	2,000	160,000	32,000
Tier 4	4,000	400,000	80,000

Claude Code Usage Limits: What You Get on Each Plan

How Usage Limits Work

Limits by Plan

What Happens When You Hit the Limit

API Rate Limits

Tips to Stay Within Limits

Understanding Limit Resets

Frequently Asked Questions

How does the Claude Code 5-hour rolling window work?

What happens when I reach my Claude Code usage limit?

Are Claude Code usage limits based on messages or tokens?

How do Claude Code API rate limits work?

Can I check my current usage level in Claude Code?

Need Higher Limits?