CUDA Memory Error While Trying to Run Bloom Locally

DanielFirpo · July 30, 2022, 8:24pm

Hey so I’m trying to get Bloom running locally I don’t do any AI coding or python so it’s tough And I ran into a roadblock

from transformers import AutoModelForCausalLM, AutoTokenizer, set_seed
import torch

torch.set_default_tensor_type(torch.cuda.FloatTensor)

print("downloading model")

model = AutoModelForCausalLM.from_pretrained("bigscience/bloom-1b3", use_cache=True)
tokenizer = AutoTokenizer.from_pretrained("bigscience/bloom-1b3")


print("done")

set_seed(416136942)

model.__class__.__name__

prompt = 'Good morning...'

input_ids = tokenizer(prompt, return_tensors="pt").to(0)

sample = model.generate(**input_ids, max_length=50, top_k=0, temperature=0.9)

print(tokenizer.decode(sample[0], truncate_before_pattern=[r"\n\n^#", "^'''", "\n\n\n"]))

I get this error: RuntimeError: CUDA out of memory. Tried to allocate 1.91 GiB (GPU 0; 8.00 GiB total capacity; 6.42 GiB already allocated; 194.69 MiB free; 6.42 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF when Googling the error I see a lot of people saying to “decrease the batch size”. idek where hugging downloads and stores Bloom, so I can’t find where to edit this batch size param. Or where in the code I’d even find it if I could find the code. Or if it will even solve the error. I’m running a copy/paste of this code, which works in the collab so it’s an issue with my gpu: Google Colab

caroz · August 28, 2022, 3:12pm

Me also, this would be great to get some feedback how to work around it.

bettyb · January 10, 2023, 8:51am

same here, did anyone find something?

Topic		Replies	Views
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 39.56 GiB total capacity; 37.84 GiB already allocated; 242.56 MiB free; 37.96 GiB reserved in total by PyTorch) 🤗Transformers	2	5348	June 7, 2023
Failed to Initialize Bloom-7B Due to Lack of CUDA memory Inference Endpoints on the Hub	5	806	May 30, 2023
torch.cuda.OutOfMemoryError 🤗Transformers	0	2054	July 5, 2023
Always getting RuntimeError: CUDA out of memory with Trainer 🤗Transformers	10	6908	April 4, 2024
RuntimeError: CUDA out of memory even with simple inference Beginners	1	5372	January 16, 2022

CUDA Memory Error While Trying to Run Bloom Locally

Related topics