LLAMA2 70b Inference api stuck on currently loading

prapti19 · November 14, 2023, 4:10am

Hi,
Up until till morning, I was using the inference APIs for llama-2-70b-chat-hf model , and now I only get the following error repetatedly:
{'error': 'Model meta-llama/Llama-2-70b-chat-hf is currently loading', 'estimated_time': 5518.13232421875}
The estimated time does not change as this error keeps on coming. I even tried periodically with a gap of few hours throughout the day, but still with no progress. When I run llama2-7b and llama2-17b its working fine, but for my research project I have to use 70b llama necessarily. Is anyone else facing this problem. Any help will be highly appreciated.

Thanks

P.S. I have bought the Pro membership of HF

eboraks · November 14, 2023, 10:39am

I have the same issue.

eboraks · November 14, 2023, 12:41pm

@prapti19 the service is back

prapti19 · November 14, 2023, 2:22pm

Thanks for the update @eboraks !

CraigJr · September 3, 2024, 1:39am

I am also having this issue, It has been persisting for at least 4 days. What do you think could be going wrong?

Topic		Replies	Views
Llama 2 Inference Endpoint Stop Working Inference Endpoints on the Hub	2	356	June 25, 2024
API access no longer working despite Pro subscription 🤗Hub	6	774	April 12, 2024
I want to know if I can use llama 2 7b for my project with hugging face pro subscription 9 $ only? Beginners	0	501	December 13, 2023
Meta-llama / Meta-Llama-3-70B-Instruct is not available as a serverless API Models	10	1617	September 28, 2024
Help using inference endpoint with Llama 3.1 405B Instruct Inference Endpoints on the Hub	1	167	August 30, 2024

LLAMA2 70b Inference api stuck on currently loading

Related topics