Model is not properly moved to GPU memory with torch.no_grad()

ekazakos · August 24, 2022, 9:04am

Hi everyone,

I’m using OWLViTForObjectDetection model and I want to perform inference on the GPU. So what I’m doing is something like:

model = model.to(device='cuda')
with torch.no_grad():
    model.eval()
    data = data.to(device='cuda')
    # inference code

It seems that the inclusion of torch.no_grad() is probably causing some of the model’s parameters to not be copied in the GPU memory because I’m getting an error that all tensors should be on the same device but at least two different devices were found (cuda and cpu). If I remove torch.no_grad() the error does not happen but then I get an out of memory error because all the model’s activation are kept in GPU memory for gradient calculation.

This has not happened to me ever in the past with various models that I’ve been using, so I’m wondering whether it is particularly related to HuggingFace models. Have this occurred to anyone else? Are there any known workarounds for this?

Thank you!

adirik · August 24, 2022, 9:55am

Hi @ekazakos,

Your are probably getting a GPU error unrelated to torch.no_grad() if you installed the PyPI release of transformers with pip install transformers. Sorry about that! This issue was fixed a few weeks ago and you should be able to run the model without any problems if you install the development branch instead:
pip install -q git+https://github.com/huggingface/transformers.git

In general, there is no need to call the eval() method within torch.no_grad(). If your issue persists, could you copy paste the minimal code to reproduce the error and the full error log?

model = model.to(device='cuda')
model.eval()
with torch.no_grad():
    data = data.to(device='cuda')

Hope this helps!

ekazakos · August 24, 2022, 9:58am

Thank you @adirik !! Will shortly try and let you know!

nielsr · August 24, 2022, 11:00am

Note that PyTorch moves a model in-place, so it’s sufficient to do:

model.to(device:"cuda")

ekazakos · August 24, 2022, 11:04am

Thanks! I’m using PyTorch a few years now and I didn’t know about this

ekazakos · August 24, 2022, 2:26pm

Hi @adirik,

It works! Thank you!

Topic		Replies	Views
torch.cuda.OutOfMemoryError when evaluate while traning 🤗Transformers	0	510	October 8, 2023
Can't inference on the train model due to some cuda problem Beginners	0	802	June 6, 2023
Move model with device_map="balanced" to CPU 🤗Transformers	1	6218	February 5, 2024
Inference without gradient computation? 🤗Transformers	2	7017	December 26, 2024
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 39.56 GiB total capacity; 37.84 GiB already allocated; 242.56 MiB free; 37.96 GiB reserved in total by PyTorch) 🤗Transformers	2	5347	June 7, 2023

Model is not properly moved to GPU memory with torch.no_grad()

Related topics