Expected Latencies
| Configuration | Typical Latency |
|---|---|
| Fast | 75-150ms |
| Standard | 400-1400ms |
extraction_mode=fast and/or resolution_mode=fast to enable fast mode.
If Consistently Slow
API latency exceeds expectations
- Check network latency - API is hosted on Fly.io (global regions)
- Cache responses - Same queries return same results
-
Use fast mode - Set
extraction_mode=fastandresolution_mode=fastfor lowest latency - Check your message length - Longer messages take more time to extract keywords from