Scheduling failure: unable to schedule

Albaninho10 · June 25, 2025, 2:19pm

Hello,

I want to deploy my model but I always get this error after +/- 20 minutes of “deployment”:

Endpoint encountered an error.
You can try restarting it using the “retry” button above. Check [ logs] for more details.
[Server message]Endpoint failed to start
Scheduling failure: unable to schedule

And in the logs I get this error:

Error 502 while fetching logs for "mon-modele-bricks-hiv":

Has this ever happened to anyone?

meganariley · June 25, 2025, 3:03pm

Hi @Albaninho10 Thank you for reporting! We’re investigating now.

meganariley · June 26, 2025, 8:18pm

Hi @Albaninho10 Thank you for waiting! This error message is related to availability of the GPU instance at the time of deployment - this can be resolved by selecting a different instance or region if possible.

We’ve added updating this error message so that it’s clearer on the roadmap, though there’s no ETA just yet. Please let us know if you have any feedback about Inference Endpoints - we’re all ears!

I also wanted to mention our Model Catalog, which has ready-to-deploy models that require no additional customization and deployment is verified by Hugging Face.

Let us know if you have other questions.

Pimpcat-AU · June 27, 2025, 12:31am

I’ve seen similar issues with deployment failures related to GPU availability. From what you’re describing, it seems like the GPU instance may not be available when the model tries to deploy, which causes the 502 error. One possible solution is to try selecting a different instance type or region during deployment to ensure that there are available GPU resources at the time of deployment. Also, double check if there’s any region specific resource limitation that might be causing the issue.

Albaninho10 · June 27, 2025, 7:44am

Thanks for the reply, indeed by changing region and GPU the model is deployed correctly !

Topic		Replies	Views
Unable to start inference endpoint: not enough hardware capacity Inference Endpoints on the Hub	6	1181	December 12, 2023
Endpoint failed to start. Scheduling failure: not enough hardware capacity Inference Endpoints on the Hub	1	447	April 15, 2024
Key Error when trying to deploy inference endpoint Inference Endpoints on the Hub	2	785	December 3, 2023
Fail to deploy newer models Inference Endpoints on the Hub	4	191	February 5, 2025
Shard Cannot Start/Inference endpoint error while deployment Models	5	189	April 6, 2025

Scheduling failure: unable to schedule

Related topics