Skip to content

Rate limits

We limit the number of API requests that can be made to Vmodel AI:

  • You can create predictions at 600 requests per minute.
  • All other endpoints you can call at 3000 requests per minute.

If you hit a limit, you will receive a response with status 429 with a body like:

json
{
  "detail": "Too many requests. Please try again in 60 seconds"
}

If you want higher limits, Contact us