I cannot load GPT-J to 12 GB VRAM Titan XP

zokica · February 13, 2022, 5:07pm

I read here that it is possible to load GPT-J to 12 GB card - Memory use of GPT-J-6B

However I tried but I could not even with recommended optimisation:

model =  GPTJForCausalLM.from_pretrained("EleutherAI/gpt-j-6B", revision="float16", torch_dtype=torch.float16, low_cpu_mem_usage=True, cache_dir = "/root/Desktop/models_cache/")
model.half()
model.to("cuda")

I get out of memory error when I try to load.

Topic		Replies	Views
Memory use of GPT-J-6B Beginners	12	21369	March 29, 2023
Running out of memory attempting to load model "EleutherAI/gpt-neox-20b" Beginners	0	561	August 6, 2023
Lower Memory Usage for TF GPT-J 🤗Transformers	1	809	May 7, 2024
Issues while tyring to run GPT-J in local Beginners	11	3422	November 12, 2022
Using gpt-j-6B in a CPU space without the InferenceAPI Spaces	0	2280	January 28, 2022

I cannot load GPT-J to 12 GB VRAM Titan XP

Related topics