API v1

Rate limits

Rate limits are enforced per API key. Each request consumes 1 credit from your monthly pool (shared with widget chat usage).

Limits by plan

PlanRequests / minCredits / month
Plus601,500
Enterprise200Unlimited

Rate limit headers

Every POST /v1/chat response includes three headers to help you manage request pacing:

  • X-RateLimit-Limit — maximum requests allowed per minute for your key.
  • X-RateLimit-Remaining — requests remaining in the current 60-second window.
  • X-RateLimit-Reset — Unix timestamp (seconds) when the rate-limit window resets.

Handling rate limits

  1. Check X-RateLimit-Remaining before each request to avoid hitting the limit.
  2. If you receive a 429 response, wait until X-RateLimit-Reset before retrying.
  3. Implement exponential backoff for retries — start at 1 second, double on each attempt, cap at 30 seconds.

Related