Running into out of memory issues

kvs-moudgalya · April 5, 2023, 5:52am

Hi,

I’m currently trying to train huggingface Diffusers for 2D image generation task with images as input.
Training on AWS G5 instances i.e., A10G GPU’s with 24GB GPU memory.
I’m running into out of memory issues when I go beyond image size of 256x256 and batch size of 8.
Results with Image size = 256 and batch size = 8 is unacceptable
I did use gradient accumulation and mixed precision training.
Using 1 attention block only.

Trying to understand is it a genuine memory issue or can it be solved by some other approaches?
Is diffusion models so heavy that even a 24GB memory GPU is insufficient?
What’s the typical memory requirement for running image to image diffusion models, for generating images resolution higher than 512x512

Thanks and regards
KVS Moudgalya

sayakpaul · April 6, 2023, 3:13am

Hey, sorry that you’re running into the issues.

The first thing I would try doing is to use xFormers: diffusers/examples/text_to_image at main · huggingface/diffusers · GitHub

kvs-moudgalya · April 24, 2023, 11:30am

Thanks @sayakpaul for your response.

I did enable xFormers in the huggingface diffusers, there is slight a bit of improvement in computational speed, but couldn’t not much in terms of memory, can’t increase image size above 256 and batch size above 8, getting out of memory issues if I go beyond these dimensions.

Topic		Replies	Views
Memory explosion while using Diffusers pipeline 🧨 Diffusers	0	528	August 2, 2023
[Diffusers] PyTorch running out of memory 🧨 Diffusers	1	797	August 30, 2022
RuntimeError: CUDA out of memory (fix related to pytorch?) Beginners	6	6303	September 20, 2022
Solving "CUDA out of memory" when fine-tuning GPT-2 🤗Transformers	0	1423	January 6, 2022
CUDA out of memory for Longformer Beginners	6	1281	October 22, 2021

Running into out of memory issues

Related topics