API access no longer working despite Pro subscription

arielflintashery · April 10, 2024, 4:35pm

It seems as if I no longer have access to the Llama 2-70b model through the API.

I have HF Pro subscription, and have been running code fine up until 20 minutes ago. I stopped my script from sending anymore queries to amend some parameter, and sent it out again only to receive the following error as a response:

{‘error’: ‘The model meta-llama/Llama-2-70b-chat-hf is too large to be loaded automatically (137GB > 10GB). Please use Spaces (Spaces - Hugging Face) or Inference Endpoints (Inference Endpoints - Hugging Face).’}

I created a new API token, but the issue persists.

To reiterate, I have an active and valid Pro subscription. If I can’t use the API, what am I paying for?

kswanjitsu · April 11, 2024, 1:19am

I am having the exact same issue.

Edited to say:
They may have taken down llama2 70b from the inference API based on other forum answers I’ve read. Because specifically c4ai-command-r-plus, larger than llama2 70b is working.

NicoloFontana · April 11, 2024, 1:25pm

I’m having the same problem.
I contacted them and they answered me that they have “temporarily removed meta-llama/Llama-2-70b-chat-hf but it will be back to use with the Inference API soon, though no ETA just yet.”
So we just need to wait.

(Also, I think that this holds for any other Llama2 version since they are all unavailable at the moment)

arielflintashery · April 11, 2024, 3:55pm

Hi Karl,

Thanks for the response, it does seem as if there is no longer support for Llama 2. Would you mind sharing with me your code for the API query for the cohere model?

For some reason my code is failing to find the correct tokenizer…

Cheers

dieforgpu · April 12, 2024, 2:40am

I guess it’s working now. I had same issue yesterday, but now it seems to work fine.

radames · April 12, 2024, 8:30pm

Sorry, folks, meta-llama/Llama-2-70b-chat-hf/ is back online after temporarily going down.

system · April 13, 2024, 8:30am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
I want to know if I can use llama 2 7b for my project with hugging face pro subscription 9 $ only? Beginners	0	501	December 13, 2023
Meta-llama / Meta-Llama-3-70B-Instruct is not available as a serverless API Models	10	1615	September 28, 2024
Model requires a Pro subscription Beginners	4	2661	August 14, 2024
LLAMA2 70b Inference api stuck on currently loading Inference Endpoints on the Hub	4	1038	September 3, 2024
Does llama-2 need pro subscription? Beginners	6	6417	November 24, 2023

API access no longer working despite Pro subscription

Related topics