Inference Llama-2-13b not working

Tanuj · July 19, 2023, 4:44pm

Following the blog, I was able to run inference on my gpu server for meta-llama/Llama-2-7b-chat-hf as has been done in the tutorial. (Have been granted access to Llama-2 already)

Now I wish to run inference on meta-llama/Llama-2-13b. I keep getting the error:
“OSError: meta-llama/Llama-2-13b does not appear to have a file named config.json. Checkout ‘https://huggingface.co/meta-llama/Llama-2-13b/main’ for available files.”

Checking the mentioned repo for config.json, I see there actually isn’t one for meta-llama/Llama-2-13b (and most other Llama2 models except meta-llama/Llama-2-7b-chat-hf).

Could the missing file(s) it be added please? Alternatively, if there’s another way to do a run the model privately, it would be great. Thanks!

0xmaddie · July 19, 2023, 7:16pm

I was able to load llama-2-13b-chat-hf with this Google Colab notebook, but inference failed for me as well due to a RuntimeError. I’m not sure if it’s entirely related to your problem, but maybe you’d like to take a look?

Tanuj · July 20, 2023, 4:13pm

Got the model working. Just need to use the models with ‘hf’ in the name.

krishnagarg09 · July 21, 2023, 9:36pm

Just to make it more clear, this runs perfectly fine:

meta-llama/Llama-2-13b-hf

flaviobrio · July 30, 2023, 8:47pm

But it says that PRO license is required to use hf ones

Topic		Replies	Views
LLAMA-2 Download issues Models	8	7864	November 7, 2023
Meta-llama / Meta-Llama-3-70B-Instruct is not available as a serverless API Models	10	1588	September 28, 2024
LLAMA2 70b Inference api stuck on currently loading Inference Endpoints on the Hub	4	1036	September 3, 2024
Does llama-2 need pro subscription? Beginners	6	6407	November 24, 2023
Inference Issue with Llama Models using HF Inference Beginners	1	30	February 6, 2025

Inference Llama-2-13b not working

Related topics