Serverless Inference API always returns 404, even for public models

SeyahMind · August 14, 2025, 10:17pm

Hi everyone,

I’m trying to use the Hugging Face Serverless Inference API, but all calls return HTTP 404 — even for small public models like distilgpt2.
Endpoint: https://api-inference.huggingface.co/models/distilgpt2
Token: Fine-grained token with Read and Make calls to Inference Providers enabled
Tested on both home Wi-Fi and mobile hotspot (same result)
Tried with PowerShell, Python requests, and Invoke-RestMethod — always 404
Works fine when running the model locally via transformers

Does anyone know why this might be happening or if something is wrong with my account setup?

John6666 · August 14, 2025, 11:42pm

The specifications, such as the URL of the endpoint, have changed significantly, so if you are using the URL directly, you will need to rebuild it based on the new information. If you are using it from a library, it is recommended that you update the library to the latest version.

Topic		Replies	Views
Inference API or Authentication Beginners	1	41	October 21, 2025
Dumb Question: Seeing that my inference API links not working Beginners	1	112	July 10, 2025
Inference API returns 404 Not Found for all models Beginners	2	177	November 3, 2025
Serverless Inference API error on new model Inference Endpoints on the Hub	5	381	September 9, 2024
API Key giving 404 Not Found error on all Inference API models Message: Beginners	1	183	July 6, 2025

Serverless Inference API always returns 404, even for public models

Related topics