Home

simple-rate-limiter

A TypeScript-based Express rate limiter middleware supporting both sliding window and token bucket algorithms for per-IP request limiting. Designed for transparency, observability, and extensibility.

Documentation is published with GitHub Pages at: https://amarpreetbhatia.github.io/simple-rate-limiter

Generate locally with npm run docs:build and open the output in docs/.

Features

  • Algorithm selection – Supports both sliding window and token bucket modes
  • Per-IP Rate Limiting – Identifies clients by IP address (customizable)
  • Configurable Storage – In-memory store provided; bring your own Redis/Memcached
  • Observability Hooks – Metrics and monitoring integration support
  • Standard Headers – Sends X-RateLimit-* and Retry-After headers
  • Safe Defaults – Fails open; allows traffic if store fails
  • TypeScript Support – Full type safety and IDE autocomplete

Installation

npm install express
npm install --save-dev @types/express typescript ts-node

Once the package is published, install it with:

npm install simple-rate-limiter

Quick Start

import express from 'express';
import { createRateLimiter } from './index';

const app = express();

const limiter = createRateLimiter({
  windowMs: 60_000,      // 60 seconds
  maxRequests: 100,      // 100 requests per window
  headers: true,         // Enable rate limit headers
});

app.use(limiter);

app.get('/', (req, res) => {
  res.send('OK');
});

app.listen(3000);

Configuration

RateLimiterConfig

Option Type Default Description
algorithm 'sliding-window' | 'token-bucket' sliding-window Select which rate limiting algorithm to use
windowMs number required Sliding window interval or token bucket evaluation window in ms
maxRequests number required Allowed requests per window or token bucket capacity
tokenBucket TokenBucketConfig none Optional token bucket settings when using token-bucket
keyGenerator (req) => string req.ip Function to derive client key
skip (req) => boolean none Skip rate limiting for specific requests
headers boolean | object false Enable standard rate limit headers
store RateLimiterStore InMemoryStore Custom storage backend
metrics RateLimiterMetrics none Observability hooks
logger RateLimiterLogger console Custom logger instance
onLimitReached (req, res, info) => void none Callback when limit first reached
onBlocked (req, res, info) => void none Callback when request is blocked

Usage Examples

Basic Setup

const limiter = createRateLimiter({
  windowMs: 15 * 60 * 1000, // 15 minutes
  maxRequests: 100,
});

app.use(limiter);

With Custom Key Generator

Rate limit by user ID instead of IP:

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 50,
  keyGenerator: (req) => req.user?.id || req.ip,
});

Skip Rate Limiting for Specific Routes

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  skip: (req) => req.path === '/health' || req.path === '/status',
});

With Metrics Integration

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  metrics: {
    recordAllowed: (req, info) => {
      // Send to Prometheus, Datadog, etc.
      prometheus.counter('rate_limiter_allowed_total', 1);
    },
    recordBlocked: (req, info) => {
      prometheus.counter('rate_limiter_blocked_total', 1);
    },
    recordCurrentUsage: (req, info) => {
      prometheus.gauge('rate_limiter_current_requests', info.currentRequests);
    },
  },
});

With Custom Logger

import { RateLimiterLogger } from './index';

class CustomLogger implements RateLimiterLogger {
  log(...args: any[]): void {
    // Use your own logging system
    myLogger.info(...args);
  }

  warn(...args: any[]): void {
    myLogger.warn(...args);
  }

  error(...args: any[]): void {
    myLogger.error(...args);
  }
}

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  logger: new CustomLogger(), // Optional; defaults to console
});

Algorithm Selection

const limiter = createRateLimiter({
  algorithm: 'token-bucket',
  windowMs: 60_000,
  maxRequests: 100,
  tokenBucket: {
    bucketSize: 100,
    refillRate: 1, // one token per second
  },
});

Custom Response on Block

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  onBlocked: (req, res, info) => {
    res.status(429).json({
      error: 'Rate limit exceeded',
      retryAfter: Math.ceil(info.resetInMs / 1000),
    });
  },
});

With Callbacks

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  onLimitReached: (req, res, info) => {
    console.warn(`Limit reached for ${info.key}`);
    // Send alert, log, etc.
  },
  onBlocked: (req, res, info) => {
    console.error(`Request blocked for ${info.key}`);
    res.status(429).send('Too many requests');
  },
});

With Custom Store (Redis Example)

import redis from 'redis';

const redisClient = redis.createClient();

const customStore: RateLimiterStore = {
  async get(key: string) {
    const data = await redisClient.get(key);
    return data ? JSON.parse(data) : null;
  },
  async set(key: string, entry: any) {
    await redisClient.set(key, JSON.stringify(entry), 'EX', 3600);
  },
  async reset(key: string) {
    await redisClient.del(key);
  },
};

const limiter = createRateLimiter({
  windowMs: 60_000,
  maxRequests: 100,
  store: customStore,
});

Response Headers

When headers: true, the middleware adds:

  • X-RateLimit-Limit – Total requests allowed in the window
  • X-RateLimit-Remaining – Remaining requests in current window
  • X-RateLimit-Reset – Unix timestamp when window resets (seconds)
  • Retry-After – Seconds to wait before retrying (only when blocked)
HTTP/1.1 200 OK
X-RateLimit-Limit: 100
X-RateLimit-Remaining: 95
X-RateLimit-Reset: 1622548234

Error Responses

When a request is blocked:

{
  "error": "Too Many Requests",
  "retryAfter": 45
}

Status code: 429 Too Many Requests

Running the Example

# Build the TypeScript
npm run build

# Run the example server
npm run dev

The server will start on http://localhost:3000 with a 10 requests/60 seconds limit.

Test with:

# Should succeed
curl http://localhost:3000/

# Make 10 requests
for i in {1..10}; do curl http://localhost:3000/; done

# 11th request should be blocked
curl http://localhost:3000/

API Reference

createRateLimiter(config: RateLimiterConfig): RequestHandler

Factory function that returns an Express middleware.

Types

interface TokenBucketConfig {
  bucketSize?: number; // Maximum tokens in the bucket
  refillRate?: number; // Tokens replenished per second
}

interface RateLimitInfo {
  key: string;                    // Client identifier
  windowMs: number;               // Window size or evaluation interval
  maxRequests: number;            // Request limit or bucket capacity
  currentRequests: number;        // Used requests or consumed tokens
  remainingRequests: number;      // Remaining allowed requests or available tokens
  resetInMs: number;              // ms until the next reset or token refill
}

interface RateLimiterLogger {
  log(...args: any[]): void;      // General logging
  warn(...args: any[]): void;     // Warning logging
  error(...args: any[]): void;    // Error logging
}

interface RateLimiterStore {
  get(key: string): Promise<RateLimiterEntry | null>;
  set(key: string, entry: RateLimiterEntry): Promise<void>;
  reset(key: string): Promise<void>;
}

Algorithms

This middleware supports two rate limiting algorithms:

  • sliding-window: ideal for request quotas over a moving time window.
  • token-bucket: ideal for smoothing traffic and allowing controlled bursts.

Sliding window behavior

  • The middleware tracks request timestamps.
  • It counts requests in the interval [now - windowMs, now].
  • If the count exceeds maxRequests, the request is blocked.

Token bucket behavior

  • Each client has a bucket of tokens.
  • The bucket refills at refillRate tokens per second.
  • Each request consumes one token.
  • If tokens are unavailable, the request is blocked until tokens replenish.

Error Handling

The middleware implements fail-open semantics:

  • If the store fails to respond, the request is allowed.
  • An error is logged for monitoring.
  • This ensures the rate limiter doesn't become a point of failure.

Performance Considerations

  • In-memory store: Suitable for single-instance deployments; avoid using it for very high client counts.
  • Distributed store: Use Redis/Memcached for multi-instance deployments.
  • Cleanup: In-memory store periodically removes stale entries to prevent memory growth.

License

MIT