If I have a small amount of VRAM compared to the model, will pytorch still use the CUDA accelerations?

mstachow · April 21, 2023, 1:15pm

I have a Geforce gt1030, which has 2GB of VRAM and ~350 CUDA cores. I’ve heard that people have had success running some models on these cards, but I’m wondering what happens if I try to run larger models, say a 13B parameter model, that would require more VRAM than I have. Does the CUDA part just get skipped (or the code doesn’t run)? Does torch break the execution into pieces and runs them on the card? If it’s something like the latter, would you imagine I still see the speedup of using CUDA compared to CPU?

Topic		Replies	Views
Should I just get more RAM? Beginners	4	2125	December 22, 2024
Torchrun uses more vram than running the script with python directly 🤗Transformers	1	352	May 27, 2024
Loading of a model takes much RAM, passing to CUDA doesn't free RAM 🤗Transformers	0	774	August 8, 2021
16 GB vs 20 GB graphics card Beginners	5	3844	October 21, 2024
GPU memory usage is twice (2x) what I calculated based on number of parameters and floating point precision Intermediate	5	445	May 18, 2024

If I have a small amount of VRAM compared to the model, will pytorch still use the CUDA accelerations?

Related topics