Scheduling failure: unable to schedule

Pimpcat-AU · June 27, 2025, 12:31am

I’ve seen similar issues with deployment failures related to GPU availability. From what you’re describing, it seems like the GPU instance may not be available when the model tries to deploy, which causes the 502 error. One possible solution is to try selecting a different instance type or region during deployment to ensure that there are available GPU resources at the time of deployment. Also, double check if there’s any region specific resource limitation that might be causing the issue.

Topic		Replies	Views
Unable to start inference endpoint: not enough hardware capacity Inference Endpoints on the Hub	6	1267	December 12, 2023
Endpoint failed to start. Scheduling failure: not enough hardware capacity Inference Endpoints on the Hub	1	472	April 15, 2024
Server message:Endpoint failed to start Inference Endpoints on the Hub	3	627	June 26, 2024
Endpoint Deployment Failed Inference Endpoints on the Hub	1	80	December 10, 2024
Cannot Setup Mixtral Models and Other Models on Inference Endpoints Inference Endpoints on the Hub	1	408	December 22, 2023

Scheduling failure: unable to schedule

Related topics