API v1
Rate limits
Rate limits are enforced per API key. Each request consumes 1 credit from your monthly pool (shared with widget chat usage).
Limits by plan
| Plan | Requests / min | Credits / month |
|---|---|---|
| Plus | 60 | 1,500 |
| Enterprise | 200 | Unlimited |
Rate limit headers
Every POST /v1/chat response includes three headers to help you manage request pacing:
X-RateLimit-Limit— maximum requests allowed per minute for your key.X-RateLimit-Remaining— requests remaining in the current 60-second window.X-RateLimit-Reset— Unix timestamp (seconds) when the rate-limit window resets.
Handling rate limits
- Check
X-RateLimit-Remainingbefore each request to avoid hitting the limit. - If you receive a
429response, wait untilX-RateLimit-Resetbefore retrying. - Implement exponential backoff for retries — start at 1 second, double on each attempt, cap at 30 seconds.
Related