Rate limits
We limit the number of API requests that can be made to Vmodel AI:
- You can create predictions at 600 requests per minute.
- All other endpoints you can call at 3000 requests per minute.
If you hit a limit, you will receive a response with status 429
with a body like:
json
{
"detail": "Too many requests. Please try again in 60 seconds"
}
If you want higher limits, Contact us