Inference API stopped working

John6666 · May 7, 2025, 7:52am

From Hugging Face Discord:

Tom Aarsen

Hello! I believe some of the inference endpoints currently have “Scale to zero” enabled temporarily, meaning they will go down when there’s no usage for a while. The first request will then be slow/fail, but subsequent ones will work. We’re going to remove the scale to zero again so that this is not an issue anymore, apologies for the inconvenience. cc @ VB can you update the scale to zero for the big ST models that already had APIs?

Topic		Replies	Views
HF Inference API last few minutes returns the same 404 exception to all models Inference Endpoints on the Hub	45	2273	June 25, 2025
Inference API time out? Site Feedback	2	935	February 28, 2024
Inference API stopped working for my model 🤗Hub	11	5408	April 26, 2023
Stable Diffusion hub demo AND inference API not working Inference Endpoints on the Hub	0	479	March 6, 2023
Disable Hosted inference API 🤗Hub	4	1791	September 30, 2021

Inference API stopped working

Related topics