Lower Memory Usage for TF GPT-J

tyyim · May 17, 2022, 2:54pm

We can use this way of lowering the memory usage for PT but how about Tensorflow?

from transformers import GPTJForCausalLM
import torch

model = GPTJForCausalLM.from_pretrained(
    "EleutherAI/gpt-j-6B", revision="float16", torch_dtype=torch.float16, low_cpu_mem_usage=True
)

Seems like both the “revision” and the “low_cpu_mem_usage” parameters do not exist in TF.

May I know whether there is any technique I can use to lower the memory usage for TF as my 3090 cannot load the GPT-J-6B model?

LeoLee92 · May 7, 2024, 5:07am

Did you get to know how to use “low_mem_usage” in Tensorflow?

Topic		Replies	Views
How to load finetuned model in TF Beginners	2	450	September 28, 2020
Memory use of GPT-J-6B Beginners	12	21340	March 29, 2023
How is memory managed when loading a model? Beginners	2	6210	July 4, 2023
[Tensorflow Export] How to export a fine tuned GPT2 model to a tensorflow model file? Beginners	1	523	January 15, 2021
GPT-J-6B Model from Transformers GPU Guide contains invalid tensors Models	0	584	March 29, 2023

Lower Memory Usage for TF GPT-J

Related topics