LLaMA 7B GPU Memory Requirement

Forbu14 · June 24, 2023, 7:31am

Basicly the idea is that you store the row weights (weigths are store in 16bit parameters format) and you also need to store the gradient of the weights. As 1 bytes = 8 bits, you need 2B for every weights and another 2B for the gradient. And that’s only the case if you use SGD optimization because if you use ADAM as your optimizer, you need more memory per weights.
So you ends up with a raw memory requirement of 4*nb_parameters if you use SGD.

Topic		Replies	Views
Memory requierements Models	2	384	February 18, 2025
Hardware Requirement GPU Beginners	3	1151	January 27, 2025
Llama 3.1 8b Instruct - Memory Usage More than Reported Models	5	453	February 18, 2025
LLaMA2 7B uses > 128 GB of GPU Ram and fails with OOM or Loss Scale Minimum 🤗Transformers	3	5562	August 17, 2023
Llama 3.1 70-B run on 32 GB Vram? 🤗Transformers	5	3773	September 20, 2024

LLaMA 7B GPU Memory Requirement

Related topics