Rate Limiting
Overview
While we are in beta, Rate limits are applied per operation type, with each having its own limit and burst capacity. Exceeding these limits will result in a 429 Too Many Requests
response.
If you consistently need higher limits, please contact us.
Rate Limits by Operation Type
We apply different rate limits based on the type of operation you're performing:
Operation Type | Limit | Burst Capacity | Window |
---|---|---|---|
Read | 1,200 | 1,000 | 1 minute |
List | 600 | 200 | 1 minute |
Write | 360 | 120 | 1 minute |
Update | 480 | 160 | 1 minute |
Delete | 240 | 80 | 1 minute |
Handling Rate Limits
When you hit a rate limit:
- You'll receive a
429 Too Many Requests
response - The response will include a
Retry-After
header - Wait for the specified time before retrying
Example error response:
Last updated: October 9, 2025
Introduction
The Mixedbread API enables powerful text embeddings, semantic search, and document intelligence capabilities for AI-powered applications.
Pagination
Understanding cursor-based pagination across Mixedbread API endpoints. Learn how to navigate through large result sets efficiently using cursors, handle pagination parameters, and implement robust pagination logic.