Subject: Hosted Inference returning 404 for multiple models (need assistance)

Hirtheesh · September 17, 2025, 6:18am

Hi Hugging Face Support,

I can access the Hub metadata (whoami and model_info succeed) but hosted inference calls return 404 for multiple models from my environment.

Details:

HF username: Hirtheesh
Environment: Windows, venv at C:\study\echoverse\venv
huggingface-hub version: (my local version)
Models tested and results:
- google/flan-t5-large → 404 Not Found (x-request-id: Root=1-68ca4f88-087b50c61af9d0812349d41b)
- sshleifer/tiny-gpt2 → 404 Not Found (tested just now)
Token: I verified my token is valid (whoami works). The token is set in the process environment for these tests.
Raw request diagnostics: POST to Models – Hugging Face returns 404 with headers including x-inference-provider: hf-inference and X-Cache: Error from cloudfront.

Could you confirm whether hosted Inference is enabled for my account/region and whether these models are available for hosted inference? If you need additional request IDs or headers, tell me what to capture and I’ll provide them.

Thanks,
Hirtheesh

John6666 · September 17, 2025, 7:36am

The Inference API has been revamped into Inference Providers, and the deployed models have changed significantly. These are the models currently deployed. flan-t5-large does not appear to be deployed and is therefore unavailable.

Hirtheesh · September 17, 2025, 8:12am

what model can i use instead of it

John6666 · September 17, 2025, 8:19am

It depends on the use case and your budget. The free tier only covers up to $0.01 worth of inference per month…
There seem to be several T5 models available.

Topic		Replies	Views
All Hosted Inference Api's are giving the http 422 error Beginners	0	304	July 17, 2023
Disable Hosted inference API 🤗Hub	4	1794	September 30, 2021
Inference API has been turned off for this model Beginners	0	992	June 6, 2023
Hosted Inference API: Error loading tokenizer Can't load config 🤗Transformers	2	1021	July 16, 2020
Hosted inference API not working 🤗Hub	1	1123	March 6, 2023

Subject: Hosted Inference returning 404 for multiple models (need assistance)

Related topics