What Is API Rate Limiting? Protect Your App from Overload

Is a user sending thousands of requests per second? Bot traffic crashing your server? Rate limiting caps the number of requests to your API, protecting both security and performance.

What Is Rate Limiting?

Rate limiting restricts the number of requests a client can make within a time window. When exceeded, the API returns HTTP 429 (Too Many Requests).

Why Rate Limit?

DDoS protection — Block malicious traffic
Resource protection — Prevent server/database overload
Fair usage — Equal service for all users
Cost control — Limit third-party API expenses

Rate Limiting Algorithms

1. Fixed Window

Count requests in fixed time windows. Simple but can spike at boundaries.

2. Sliding Window

Smoother distribution using a rolling time window with sorted sets.

3. Token Bucket

Tokens refill at a constant rate. Each request consumes a token. Allows bursts.

4. Leaky Bucket

Requests processed at a constant rate. Excess queued or rejected.

Express Middleware

import rateLimit from 'express-rate-limit';

const limiter = rateLimit({
  windowMs: 15 * 60 * 1000,
  max: 100,
  standardHeaders: true,
  message: { error: 'Too many requests. Try again in 15 minutes.' }
});

app.use('/api/', limiter);

Nginx Rate Limiting

limit_req_zone $binary_remote_addr zone=api:10m rate=10r/s;

location /api/ {
    limit_req zone=api burst=20 nodelay;
    limit_req_status 429;
}

Response Headers

HTTP/1.1 429 Too Many Requests
RateLimit-Limit: 100
RateLimit-Remaining: 0
Retry-After: 45

Tiered Limits

| Level | Scope | Example | |-------|-------|---------| | Global | Entire API | 10,000 req/min | | User | Per user | 100 req/min | | IP | Per IP | 50 req/min | | Endpoint | Per route | /login: 5 req/min | | Plan | By subscription | Free: 100, Pro: 1000 |

Best Practices

Informative responses — Include Retry-After header
Layered limits — Global + user + endpoint-specific
Whitelist — Exempt trusted internal services
Graceful degradation — Degrade quality before hard rejecting
Monitor — Track rate limit triggers and set up alerts
Distributed counting — Use Redis for centralized counters across servers

Conclusion

Rate limiting is fundamental to API security and stability. The right algorithm protects against abuse while ensuring fair access for legitimate users.

Learn API security and rate limiting on LabLudus.

What Is API Rate Limiting? Protect Your App from Overload

What Is API Rate Limiting? Protect Your App from Overload

What Is Rate Limiting?

Why Rate Limit?

Rate Limiting Algorithms

1. Fixed Window

2. Sliding Window

3. Token Bucket

4. Leaky Bucket

Express Middleware

Nginx Rate Limiting

Response Headers

Tiered Limits

Best Practices

Conclusion

İlgili Yazılar

How to Build a SaaS Product: A Starter Guide

No-Code and Low-Code: Build Apps Without Coding