Mixedbread

Rate Limiting

Overview

While we are in beta, Rate limits are applied per operation type, with each having its own limit and burst capacity. Exceeding these limits will result in a 429 Too Many Requests response.

If you consistently need higher limits, please .

Rate Limits by Operation Type

We apply different rate limits based on the type of operation you're performing:

Operation TypeLimitBurst CapacityWindow
Read1,2001,0001 minute
List6002001 minute
Write3601201 minute
Update4801601 minute
Delete240801 minute

Handling Rate Limits

When you hit a rate limit:

  1. You'll receive a 429 Too Many Requests response
  2. The response will include a Retry-After header
  3. Wait for the specified time before retrying

Example error response:

{
    "type": "too_many_requests_error",
    "url": "https://www.mixedbread.com/api-reference",
    "message": "Rate limit exceeded. Please try again later.",
    "details": {
        "retry_after": 60,
        "limit": "1200",
        "remaining": "0",
        "reset": "1730000000"
    }
}
Last updated: October 9, 2025