Memory usage by later pipeline stages

keturn · September 28, 2022, 1:22am

vRAM usage during my pipeline run looks like this:
PyTorch Memory Use

It uses a bunch of memory when I load the pipeline with from_pretrained, which is expected. Then more memory when I call the pipeline. Then it holds steady for a while as the pipeline is iterating through its steps, before making one smaller jump later.

I assume that last one is when the VAE is invoked to decode the output.

I didn’t number this chart, but the total range of the Y-axis is 8 GB, which makes that last jump something like half a gig.

While that seems like a lot for a measly 0.75 MB worth of pixel data, it’s not so much the amount I’m concerned with. It’s modest in comparison to the overall needs of the pipeline. My question is why the allocator grabs more memory at that time.

The diffusion model is done. Shouldn’t it be able to reclaim more than enough memory from that?

I guess the answer is that the allocator is grabbing more memory because it can — there’s no memory pressure yet. Throwing a torch.cuda.empty_cache() in there before that stage seems to confirm this: then memory goes down a tad at that point, not up; what’s more is that it stays there for the next pipeline run.

So maybe I’ve answered my own question, but I’m still confused about that big chunk of memory that is not allocated during pipeline init, but seems to be on our first forward call, and I’m not sure if or when it is reclaimed.

doem1997 · October 18, 2023, 1:44pm

Same problem here. The StableDiffusionPipeline got incremental VRAM, especially at the end of a batch inference. A simple solution is just to add this line:

# This line below
pipe.final_offload_hook = True

Explaination

The code offloads the modules before loading the VAE decoder. Concretely, I found this code:

if hasattr(self, "final_offload_hook") and self.final_offload_hook is not None:
    self.unet.to("cpu")
    self.controlnet.to("cpu")
    torch.cuda.empty_cache()

So just set this final_offload_hook should work.

evanpria · December 12, 2023, 9:24am

i have a same problem, that my virtual memory reach 28G

is there any solution to get back my memory like before?

Topic		Replies	Views
OOM error after creating pipeline 🧨 Diffusers	5	2895	January 13, 2023
Abnormally high VRAM required when using StableVideoDiffusionPipeline 🧨 Diffusers	2	636	January 2, 2024
Memory explosion while using Diffusers pipeline 🧨 Diffusers	0	512	August 2, 2023
Extra GPU usage on custom Qwen2-VL 🤗Transformers	0	151	October 28, 2024
Flux.1-dev installation Models	1	2940	August 31, 2024

Memory usage by later pipeline stages

Explaination

Related topics