Hi, I wanted to play with the LLaMA 7B model recently released. With the command below I got OOM error on a T4 16GB GPU. How much GPU do I need to run the 7B model? In the Meta FAIR version of the model, we can adjust the max batch size to make it work on a single T4. What should be done here to ma…

To run the 7B model in full precision, you need 7 * 4 = 28GB of GPU RAM. You should add torch_dtype=torch.float16 to use half the memory and fit the model on a T4.

LLaMA 7B GPU Memory Requirement

🤗Transformers

Atharva May 10, 2023, 12:29am 3

How much would 13B take, 13*4 = 52 GB?

We are getting a CUDA OOM error while finetuning a 13B Llama model on a 4xA100 cluster, what may we be doing wrong

Topic		Replies	Views
Llama2 70b - Cuda out of memory exceptions 🤗Transformers	0	154	February 28, 2024
Requirements Llama2 Intermediate	0	280	April 13, 2024
Memory requierements Models	2	425	February 18, 2025
How to ensure that while running with llama2-70B, we use parallelism? 🤗Optimum	11	1586	August 22, 2023
LLaMA2 7B uses > 128 GB of GPU Ram and fails with OOM or Loss Scale Minimum 🤗Transformers	3	5568	August 17, 2023

LLaMA 7B GPU Memory Requirement

Related topics