Constant 503 error for several days when running LLAMA 3.1

saramagdeline · September 3, 2024, 5:18pm

Hi all. Earlier this month I wrote a script incorporating LLAMA 3.1, and it worked well (with timeouts, etc), but for the past week whenever I try to access LLAMA 3.1 at all I get a 503 error (including for short simple requests, like HF’s sample code).

Any idea what the problem may be? I do indeed have Pro.

ChrisCleaner · April 19, 2025, 11:16am

Did you find a solution to this problem? I am experiencing the same problem on and off for a couple of days now.

John6666 · April 19, 2025, 11:28am

There were at least three reports of the same situation on Discord, so I think it’s probably some kind of error. @michellehbn

gikebe · April 24, 2025, 11:23am

Am still getting a 503 when running Llama-3.2-3B-Instruct. Any known solution?
huggingface_hub.errors.HfHubHTTPError: 503 Server Error: Service Temporarily Unavailable for url: https://router.huggingface.co/hf-inference/models/meta-llama/Llama-3.2-3B-Instruct

John6666 · April 24, 2025, 12:43pm

Depending on the model, it seems that this can be avoided by the following methods.

gikebe · April 25, 2025, 6:00am

Thanks. Using providers works like magic. Here is the documentation: HF Inference
Sample

client = InferenceClient(
    provider="hf-inference", 
    token=os.environ["HF_TOKEN"])

Topic		Replies	Views
HfHubHTTPError: 503 server error Beginners	1	579	April 17, 2025
Inference API returns 504 error for Llama-3.2-3B-Instruct & google/gemma-2-2b-it Inference Endpoints on the Hub	3	35	April 21, 2025
Getting "502 Server Error: Bad Gateway for url: https://api-inference.huggingface.co/models/meta-llama/Llama-3.2-3B-Instruct" error 🤗Hub	8	281	April 28, 2025
Not able to access meta-llama/Llama-3.2-3B-Instruct Beginners	3	361	April 25, 2025
Help using inference endpoint with Llama 3.1 405B Instruct Inference Endpoints on the Hub	1	171	August 30, 2024

Constant 503 error for several days when running LLAMA 3.1

Related topics