Question 1

What is the difference between token bucket and leaky bucket?

Accepted Answer

Token bucket allows bursts up to the bucket capacity, then enforces a sustained rate equal to the refill rate. Leaky bucket drains at a fixed rate regardless of arrival pattern, smoothing output to a steady pace and giving no burst headroom.

Question 2

How big should the bucket capacity be?

Accepted Answer

Large enough to absorb normal burst patterns like page-load fan-out, mobile app startup, or retry storms, and small enough that a bad actor cannot cause harm with a single burst. A common starting point is 5-10 times the per-second refill rate.

Question 3

Should limits be per user or per IP?

Accepted Answer

Per authenticated key or user is fairest and most accurate. Per IP is a useful fallback for unauthenticated traffic but fails when many users share one IP (corporate NAT, university network).

Question 4

What response headers should accompany a 429?

Accepted Answer

Return X-RateLimit-Limit (the bucket capacity), X-RateLimit-Remaining (tokens left), X-RateLimit-Reset (when the bucket refills), and Retry-After (seconds until the client should retry).

Question 5

How do I handle distributed rate limiting across multiple servers?

Accepted Answer

In-process token buckets don't coordinate across instances. Use a shared store like Redis with atomic increment operations, or a dedicated rate-limit service, to enforce a global limit consistently.

API Rate Limit Calculator

Frequently Asked Questions

API Rate Limit Calculator

Frequently Asked Questions

Related Developer Tools Calculators