Persistent 404 Not Found Errors with Public Inference API

Mehdimemar · May 8, 2025, 12:02pm

Problem:
For the past day or so, all My attempts to make POST requests to the public Inference API endpoint result in a 404 Not Found error. This happens regardless of the model I try to query, including standard, known-available models like gpt2. The response body simply contains “Not Found"“

My Hugging Face Username: Mehdimemar

Troubleshooting Steps Taken:

Model Validity Confirmed: I’ve tested numerous valid model IDs (like gpt2, distilbert-base-uncased-fine tuned-sst-2-english, various Segmentation models). The 404 error occurs consistently.
Access Token Verified: I have generated multiple new User Access Tokens from my account settings with the read role. I’ve carefully copied them and ensured they are correctly formatted in the Authorization: Bearer YOUR_HF_TOKEN header. Tried to write tokens as well, same result.
Network Connectivity Verified: nslookup, ping, and tracert to api-inference.huggingface.co are all successful from my testing environment. General internet connectivity is working fine (tested against httpbin.org).
Direct curl Test (Outside other platforms): To isolate the issue, I performed direct tests using curl from my local machine. These tests also result in the same 404 Not Found error. Example command available upon request.
Checked Hugging Face Status Page: The status page [1: status] indicates services are operational, though HF Inference shows some past instability. The persistent 404 error doesn’t seem like a temporary service unavailability (usually 503).
Checked Account Settings: I’ve reviewed my account settings (Tokens, Billing [though not required for public API], etc.) via [ 2: settings/tokens] and haven’t found any obvious issues, restrictions, or required actions. My email is verified

Mehdimemar · May 8, 2025, 12:04pm

Conclusion / Question:

Given that network connectivity is fine, valid models are being used, valid tokens (with correct permissions) seem to be sent correctly (verified by curl -v), the issue strongly suggests a problem with token validation specific to my account (Mehdimemar) or an unknown restriction/status issue with my account preventing Inference API access.

Has anyone else experienced similar persistent 404 errors recently? Is there anything specific I should double-check, or could this require investigation by the Hugging Face team?

John6666 · May 9, 2025, 12:04am

I think pretty much all users are in that state…

syuys · May 13, 2025, 8:00am

New to the huggingface, and exprienced this issue too…

MrAR · May 13, 2025, 1:37pm

I am also experiencing the same issue and I checked with many others too. This issue is been coming for few days. I think its some bug in their system and some its because they are shifting to other inference providers

lladawn · May 14, 2025, 2:53am

I am also facing this same issue.

vonba · May 15, 2025, 1:13am

Me too. I need to do a huge training task for an application, and I paid for the pro tier, plus already paid a good chunk extra for the first part of the training task. Really frustrating that I now am stuck without being able to complete the task even though I forked out a bunch. Is there no way to get an official reply? I haven’t seen any way of contacting support, if it exists. The official status page says all systems go.

John6666 · May 15, 2025, 2:09am

For libraries, it is best to contact the developer via GitHub.

There are several ways to contact support for general issues with Hub.

website@huggingface.co

meganariley · May 15, 2025, 3:12pm

Hi all, thanks for reporting! You can check to see if your model is available to use with the HF Inference API (or any Inference Provider) here: Models - Hugging Face. If it’s not deployed by any Inference Provider, you can request provider support on the model page.

Please note Inference Endpoints is available to use - more info here: Inference Endpoints.

Thanks!

sabimhamed · June 20, 2025, 9:44am

Yes me too i faced the issue with some models but for the following two models it works fine:

model: “mistralai/Mixtral-8x7B-Instruct-v0.1”,
model: ‘meta-llama/Llama-3.3-70B-Instruct’,

Note that i have checked few only.

scienzasolutions · July 29, 2025, 4:00pm

Hi there! has this been resolved?

John6666 · July 29, 2025, 10:19pm

As mentioned above, if the model is currently deployed, it should be available via the Inference Provider.

Please note that the program, or rather the Endpoint URL, has changed slightly.

asherzad · August 20, 2025, 4:17am

I’m using “Qwen/Qwen2.5-Coder-32B-Instruct” model and was getting the same/similar error. I modified the initialization a little bit, adding max_token, provider and etc. and it worked for me. Here’s how I initialize it:

llm = HuggingFaceInferenceAPI(
    model_name="Qwen/Qwen2.5-Coder-32B-Instruct",
    temperature=0.7,
    max_tokens=100,
    provider="auto"
)

In my case, I think the issue was with max_token, because I was almost running out of free tokens, so that could be a confusing issue really, and the error doesn’t say it’s payment issue unless you’ve completely used your free tokens.

Sometimes it could be because of running out of free tokens too, so take a close look at the error message.

Try assigning a lower max_token and see how it’ll work.

Topic		Replies	Views
Persistent 'Not Found' (404) Error on Inference API with New Account & Valid Token Beginners	1	225	June 24, 2025
Consistent 404 Not Found on Inference API for all models Beginners	6	169	October 3, 2025
Inference API returns 404 Not Found for all models Beginners	2	330	November 3, 2025
Account Issue: Consistent 404 Not Found on Inference API for all models Beginners	1	81	September 30, 2025
Persistent 404 Error on Inference API with Verified Account & Valid Token 🤗Transformers	1	59	October 10, 2025

Persistent 404 Not Found Errors with Public Inference API

Related topics