Memory use of GPT-J-6B

sgugger · September 17, 2021, 12:00pm

You need at least 12GB of GPU RAM for to put the model on the GPU and your GPU has less memory than that, so you won’t be able to use it on the GPU of this machine. You can’t use it in half precision on CPU because all layers of the models are not implemented for half precision (like the layernorm layer) so you need to use the model in full precision on the CPU to make predictions (that will take a looooooooong time).

AS for the RAM footprint, we are working on a way to load the model with from_pretrained to only consume the model memory in RAM (currently it consumes twice the model size). It should be merged soon.

Topic		Replies	Views
I cannot load GPT-J to 12 GB VRAM Titan XP Beginners	0	551	February 13, 2022
Issues while tyring to run GPT-J in local Beginners	11	3426	November 12, 2022
Using gpt-j-6B in a CPU space without the InferenceAPI Spaces	0	2284	January 28, 2022
Issues running GPT-J-6B Beginners	1	1122	January 31, 2023
Lower Memory Usage for TF GPT-J 🤗Transformers	1	810	May 7, 2024

Memory use of GPT-J-6B

Related topics