Core Concepts
Rate limits
Design around throughput limits and recover gracefully.
Handling limits
| Scenario | Recommended behavior |
|---|---|
| HTTP 429 | Retry with exponential backoff and jitter. |
| Burst traffic | Queue or batch lower-priority work. |
| Background jobs | Spread large jobs over time using workers. |
| User-facing chat | Show graceful retry UI and preserve drafts. |