Inference API down?

LoGic142 · June 3, 2024, 7:41am

While accessing this(speechbrain/lang-id-voxlingua107-ecapa · Hugging Face) model via Inference API, I am getting the following error -

(MaxRetryError(‘HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /api/models/speechbrain/lang-id-voxlingua107-ecapa (Caused by NameResolutionError(“<urllib3.connection.HTTPSConnection object at 0x7f5306290cd0>: Failed to resolve 'huggingface.co' ([Errno -3] Temporary failure in name resolution)”))’), ‘(Request ID: d978f641-257c-45c4-b95b-c51865344dfe)’)

Can someone provide more insight into this error? And how do we solve it?

RitchieP · June 3, 2024, 8:28am

I’m also facing issues with Inference API all of a sudden

votepurchase · June 3, 2024, 10:03am

same here.

jormeijer · June 3, 2024, 10:23am

We are also facing issues with the Inference Endpoints

John6666 · June 3, 2024, 10:29am

Same for hours.

nielsr · June 3, 2024, 11:06am

Thanks for reporting, issue should be fixed now.

John6666 · June 3, 2024, 11:09am

Fixed. Thank you.

pd-t · June 3, 2024, 2:33pm

@nielsr Does fixed mean that it is now a 500 internal server error? I am currently facing this error with all 3 providers and multiple models.

pd-t · June 4, 2024, 8:14am

@nielsr I have debugged the problem further. The endpoints work as long as they are public. Both with and without scale to zero. If I secure the endpoint and request it without a token, a 401 is returned. So far so good. But if I pass a valid token, I get a 500. Do your integration tests work?

pd-t · June 4, 2024, 11:43am

@nielsr Ok. This is really weird now. For 2 hours I got 401 from the UI creating new endpoints and deleting existing ones (which costed me 12$) or even showing existing instances. Now the instance is visible again. And the Endpoint ist working with toking. So I got a last question: Are you fixing things in production without customer feedback and what kind of availability and stability can I expect from dedicated endpoints? Are they ready for production (>99,9% availability)?

nielsr · June 4, 2024, 6:10pm

Hi,

Yes they should be ready for production (they aim to make putting ML models in production easier with a few clicks). I appreciate your feedback, I’m not part of the Inference Endpoints team but will forward your feedback to them.

rjurney · June 8, 2024, 12:12am

The APIs for the evaluate library are also down for five days.

rjurney · June 8, 2024, 12:12am

Just go here and see the runtime errors: evaluate-metric (Evaluate Metric)

Topic		Replies	Views
HF Inference API last few minutes returns the same 404 exception to all models Inference Endpoints on the Hub	45	1885	June 25, 2025
Dumb Question: Seeing that my inference API links not working Beginners	1	50	July 10, 2025
"Bad Request: Your endpoint is in error, check its status on endpoints.huggingface.co Models	4	201	June 16, 2025
Inference API stopped working Inference Endpoints on the Hub	50	4371	June 8, 2025
Inference API stopped working for my model 🤗Hub	11	5369	April 26, 2023

Inference API down?

Related topics