Llama-2 on colab

suhaaspk · August 8, 2023, 7:25pm

Hello!

I am trying to download llama-2 for text generation on google colab free version. I tried simply the following

model_name = "meta-llama/Llama-2-7b-chat-hf"
tokenizer = AutoTokenizer.from_pretrained(model_name, token=True)
model = AutoModelForCausalLM.from_pretrained(model_name, token=True)

But this gives me an “ran out of RAM” error and the runtime crashes. I noticed that the GPU RAM wasn’t being used and the CPU RAM was going past the limit and causing the runtime to crash. I saw some potential solutions of trying to checkpoint online – I haven’t done this before so I have to learn how but will learn if that is useful. Are there any ways to successfully get this model running on colab. Additionally, as a more general question – How can I predict how much memory it takes to run a specific model?

Any advice is much appreciated. Thank you!

obi77 · August 8, 2023, 8:47pm

I think I am having the same issue Colab RAM Limit Exceeded: Unable to Run 3B Model Even with Quantization I will share a solution if I find it

philippetatel1 · August 9, 2023, 10:10pm

by any chance you found something

Amansoni · November 28, 2023, 4:50am

You can use llama 2 in colab using 4 bit quantization this shorten the memory usage but this will not work without GPU below is the link:

To use the model below is the main code:

if torch.cuda.is_available():
torch.set_default_device(‘cuda’)

model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-chat-hf",
                                            torch_dtype="auto", load_in_4bit=True)

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-2-7b-chat-hf",
                                          torch_dtype="auto")
tokenizer.use_default_system_prompt = False

Topic		Replies	Views
Colab RAM Limit Exceeded: Unable to Run 3B Model Even with Quantization Beginners	0	1059	August 8, 2023
Colab CUDA OOM using Llama-2-7b-chat-hf even with 40GPU RAM 🤗Transformers	0	906	December 29, 2023
Can't load fine tuned LLamav2 7b Beginners	2	1112	October 13, 2023
How to get Llama-2-13b-chat-hf to ACTUALLY RUN Beginners	0	254	May 30, 2024
Colab RAM crash error - Fine-tuning RoBERTa in Colab Beginners	3	6502	December 15, 2020

Llama-2 on colab

Related topics