Clear Cache with Accelerate

sheldon-spock · January 1, 2023, 6:58pm

Hello folks!

I am trying to clear the cache for multi gpu training. I am using both torch.cuda.empty_cache() and accelerator.free_memory(), however the gpu memory is getting saturated. torch.cuda.empty_cache() worked for the same code on single gpu when I wasn’t using accelerate (after deleting the unused variable and using gc.collect()).

Can someone suggest how to clear the gpu memory for all gpus when doing multi-gpu training on accelerate?

smangrul · January 2, 2023, 9:58am

Hello @sheldon-spock, can you provide a minimal reproducible example code for the issue?

sheldon-spock · January 2, 2023, 12:50pm

Hi @smangrul sure,

loss_1, loss_2, loss_3 = stack(batch_input, batch_labels) #"stack" refers to 2 models applied in series
loss = loss_1 + loss_2 + loss_3
accelerator.backward(loss)
optimizer.step()
optimizer.zero_grad()
del loss_1, loss_2, loss_3
gc.collect()
torch.cuda.empty_cache()
accelerator.free_memory()

When I was doing this on a single gpu without accelerator, the gpu utilization went down significantly after every training step ending with torch.cuda.empty_cache() (I checked this by printing gpu utilization when calling the models). However, I am getting almost no reduction in memory on the multiple gpus on accelerate.

Thanks!

SUNM · May 5, 2023, 7:21am

hi @sheldon-spock and @smangrul , I am using the accelerate for multiple GPUs but I get the cuda memeory error. I am using 4 GPus and I am surprised why I get cuda memory error. do you have any idea how to solve this issue?

many thnaks

Topic		Replies	Views
How to clear GPU memory with Trainer without commandline 🤗Transformers	1	2842	June 1, 2024
GPU memory not being freed between batches 🤗Transformers	0	1732	June 24, 2022
Multi-GPU Training using Accelerate: RAM Issue Leading to Failure 🤗Accelerate	0	94	July 16, 2024
Accelerate throws CUDA: OOM 🤗Accelerate	0	435	August 22, 2024
Cuda Out of Memory with Multi-GPU Accelerate for gemma-2b 🤗Accelerate	1	131	December 22, 2024

Clear Cache with Accelerate

Related topics