Are InferenceClient()'s down?

gtvracer · July 2, 2025, 7:53am

For meta-llama and mistral text generation LLM using the InferenceClient(), I’m getting: Bad Request: The endpoint is paused, ask a maintainer to restart it

Is something not working at HF?

John6666 · July 2, 2025, 10:46am

Is it explicitly PAUSED? Maybe it’s under maintenance or something… @michellehbn @meganariley

michellehbn · July 2, 2025, 1:22pm

Hi @gtvracer, Thanks for reaching out and for being PRO On the model page, you can request provider support of that model if the model is not deployed and available for use through Inference Providers.

As an example, meta-llama/Llama-3.1-8B-Instruct is currently available using providers like Featherless AI, Nscale, SambaNova, Fireworks, Hyperbolic, etc. You can also deploy models with Inference Endpoints (dedicated): Inference Endpoints

To see which models are available to use with HF Inference, check out our filtered search here: Models - Hugging Face

anakin87 · July 2, 2025, 2:00pm

Ok. Text generation models are no longer available through HF Inference API: Models - Hugging Face

Is this intended?

szkumar · July 2, 2025, 2:48pm

This would be a massive bummer haha

gtvracer · July 2, 2025, 2:54pm

Exception:504 Server Error: Gateway Time-out for url: https://api-inference.huggingface.co/models/meta-llama/Llama-3.3-70B-Instruct/v1/chat/completions

John6666 · July 2, 2025, 3:04pm

Currently, it appears that no text generation models have been deployed in HF-Inference. However, it seems that some have been deployed in other Inference Providers…

gtvracer · July 2, 2025, 3:38pm

How can HF suddenly withdraw support from InferenceClient’s that worked yesterday, going forward? Without warning or anything? Very unprofessional and very disappointing…

gtvracer · July 3, 2025, 12:01am

Make sure you have your huggingface-hub updated to 0.33.2. It will know how to accept the provider parameter. I used “nebius” for meta-llama models and it worked.

gtvracer · July 3, 2025, 12:43am

use: provider=“together” for mistralai models..

anakin87 · July 3, 2025, 9:46am

Almost official news from HF DevRel on Discord

yes! these models were sunset as part of us closing down hugging.chat unfortunately
however there’s quite a lot of models that you can use through our inference providers as a replacement

Topic		Replies	Views
Inference API stopped working Inference Endpoints on the Hub	50	4070	June 8, 2025
HF Inference API last few minutes returns the same 404 exception to all models Inference Endpoints on the Hub	45	1768	June 25, 2025
Not able to access meta-llama/Llama-3.2-3B-Instruct Beginners	3	342	April 25, 2025
Cannot use Inference Provider. 429 error. First time usage Inference Endpoints on the Hub	6	64	May 5, 2025
Help using inference endpoint with Llama 3.1 405B Instruct Inference Endpoints on the Hub	1	166	August 30, 2024

Are InferenceClient()'s down?

Related topics