Gateway Problem

JoseLuisNeves · December 20, 2024, 11:22am

I bought huggingface pro. I am trying to use the model microsoft/Phi-3-mini-128k-instruct on HuggingFace’s Serverless API since yesterday through the following code:

response = self.client.chat.completions.create(
model=self.model_name,
messages=[{“role”: “user”, “content”: prompt}],
temperature=0,
max_tokens=300
)
However, I keep having the same errors:

and
504 Server Error: Gateway Time-out for url: https://api-inference.huggingface.co/models/microsoft/Phi-3-mini-128k-instruct/v1/chat/completions

I tried other models such as meta-llama/Llama-3.2-1B and I keep having the same problem. In the midst of the errors, there were like one or two sucessful requests. Is this normal?

gregstone · January 5, 2025, 7:07pm

I am getting the same thing.

meganariley · January 7, 2025, 6:55pm

Hi @JoseLuisNeves The model microsoft/Phi-3-mini-128k-instruct is not loaded on the serverless API, but you can use this model with Inference Endpoints. Inference Endpoints allows you to easily deploy your models on dedicated, fully-managed infrastructure, and will give you the flexibility to quickly create endpoints on CPU or GPU resources. It’s billed by compute uptime vs character usage.

Topic		Replies	Views
Phi-3-mini-128k-instruct not working with pro inference api Inference Endpoints on the Hub	14	2253	August 26, 2024
Getting "502 Server Error: Bad Gateway for url: https://api-inference.huggingface.co/models/meta-llama/Llama-3.2-3B-Instruct" error 🤗Hub	8	215	April 28, 2025
Serverless inference issues for a new Go library Inference Endpoints on the Hub	4	30	March 18, 2025
Request to Serverless Inference API failed with 400 status code Inference Endpoints on the Hub	2	227	March 4, 2025
Are all the documents in hugging face comes with 504 bad gateway err? Beginners	2	1377	November 28, 2022

Gateway Problem

Related topics