429 Too Many Requests before the handler runs, so your
integration sees the throttle even when the upstream service is healthy.
Default limits
| Limit | Default value |
|---|---|
| Steady-state rate | 10 requests / second per key |
| Burst | 20 requests |
What a throttled response looks like
Retry-After value is the suggested number of seconds to wait before retrying. Build
your client to honour it; aggressive retries without backoff will keep you throttled.
Recommendations
- Cache reads on your side when you can —
GET /v1/conversation/{id}is the most common source of preventable load. - Use exponential backoff with jitter on every
429and on503. - Spread out batch jobs; the per-second rate limit applies regardless of how the requests cluster.
- Treat the limits as a contract, not a soft target. If your integration grows past them, request an increase ahead of the change.