Hi, I wanted to play with the LLaMA 7B model recently released. With the command below I got OOM error on a T4 16GB GPU. How much GPU do I need to run the 7B model? In the Meta FAIR version of the model, we can adjust the max batch size to make it work on a single T4. What should be done here to ma…

LLaMA 7B GPU Memory Requirement

sgugger March 21, 2023, 8:34pm 2

To run the 7B model in full precision, you need 7 * 4 = 28GB of GPU RAM. You should add torch_dtype=torch.float16 to use half the memory and fit the model on a T4.

16 Likes

Topic		Replies	Views
Memory requierements Models	2	336	February 18, 2025
Hardware Requirement GPU Beginners	3	1061	January 27, 2025
Llama 3.1 8b Instruct - Memory Usage More than Reported Models	5	403	February 18, 2025
LLaMA2 7B uses > 128 GB of GPU Ram and fails with OOM or Loss Scale Minimum 🤗Transformers	3	5552	August 17, 2023
Llama 3.1 70-B run on 32 GB Vram? 🤗Transformers	5	3688	September 20, 2024

LLaMA 7B GPU Memory Requirement

Related topics