Generating 10000 sentences from GptNeo Model results in out of memory error

prb977 · March 23, 2022, 1:45am

I was doing some work where I wanted to generate 10000 sentences from the GptNeo Model. I have a GPU of size 40GB and am running the model in the GPU but everytime the code runs out of memory. Is there a limitation to the number of sentences that I can generate. Below is a small snippet of my code.

tokenizer = GPT2Tokenizer.from_pretrained(model)
model = GPTNeoForCausalLM.from_pretrained(model , pad_token_id = tokenizer.eos_token_id)
model.to(device)
input_ids = tokenizer.encode(sentence, return_tensors=‘pt’)
gen_tokens = model.generate(
input_ids,
do_sample=True,
top_k=50,
num_return_sequences=10000
)

Topic		Replies	Views
Change length of GPT-neo output Beginners	6	1880	June 10, 2021
GPT-NeoX inference OOM with plenty of available memory 🤗Transformers	2	894	August 1, 2023
Running out of memory attempting to load model "EleutherAI/gpt-neox-20b" Beginners	0	561	August 6, 2023
Problems with gpt-neo training Beginners	0	472	December 12, 2022
Google/gemma-2-2b-it Crashes in Google colab Models	0	52	September 5, 2024

Generating 10000 sentences from GptNeo Model results in out of memory error

Related topics