Conventions / Rate limits

Rate limits

Limits are a flat per-minute quota, not tiered by plan. They protect the API from runaway loops, with plenty of headroom for normal use.

100

requests / minute

REST API

Per API key

240

requests / minute

MCP server

Per credential

requests / minute

Public endpoints

Per IP

The window is a 60 second sliding window. Authenticated limits are keyed by API key, so two keys behind one IP each get a full quota, and one key used across many IPs still shares a single quota.

When you hit a limit

Over quota, the API returns 429 with the error code rate_limited. The Retry-After response header carries the number of seconds until the window resets. Sleep that long, then retry.

429 Too Many Requests

Retry-After: 23

{
  "error": {
    "type": "rate_limit_error",
    "code": "rate_limited",
    "message": "Too many requests. Retry after 23 seconds."
  }
}