| title | API error code 503 - The engine is currently overloaded | |
|---|---|---|
| description | Resolve the W&B Inference 503 "engine overloaded" error returned during periods of high traffic on the API. | |
| keywords |
|
A 503 error with the message "The engine is currently overloaded, please try again later" means the W&B Inference server is experiencing high traffic and cannot process your request right now.
During periods of high demand, the inference engine may become temporarily overloaded. This is a transient condition that typically resolves on its own as traffic subsides.
-
Retry after a short delay
- Wait a few seconds before retrying your request
- Use exponential backoff to avoid adding to the congestion
-
Spread out requests
- If you're sending many requests, consider spacing them out over time
- Implement request queuing to smooth traffic spikes
{/* AUTO-GENERATED: tab badges /} Server Errors {/ END AUTO-GENERATED: tab badges */}