Accelerate! I have a query, no actual problem to be solved!

cryptojointer · August 7, 2023, 5:07pm

So I’m about to try and replicate a QLoRA notebook, hopefully fine tune a task in 4bit Peft. The idea behind it is to train a very large LM (billions of parameters in size) on a single GPU. As the explanation finished and the narrator talks through the libraries needed he says ‘accelerate’ will be used.

ACCELERATE - I though this was to train across multiple GPU’s? But if the intention is to train on a local and single GPU, why do we need accelerate? Anyone have a quick 5 mins just to set my mind straight and in the right direction??

Thanks! Again, no problem, it would just be nice to hear from someone other than Chat GPT for my answer…

hansekbrand · August 7, 2023, 5:23pm

If your model fits a single GPU I think accelerate provides nothing of interest for you. It does offer deepspeed integration which can help if parts of the model must be offloaded to the CPU, in case your VRAM is a bit too low for your model.

cryptojointer · August 8, 2023, 4:27am

Brilliant, thanks! Yer id dint think it was needed so was surprised when it was used in this colab notebook. Cheers

Topic		Replies	Views
Multi GPU training - Model parallelism DeepSpeed	1	1883	February 2, 2024
What does "--multi_gpu" do under the hood? (and how to use it) 🤗Accelerate	7	6353	May 31, 2023
Loading weights straight to GPU & Training support 🤗Accelerate	0	214	September 18, 2023
LoRA training with accelerate / deepspeed DeepSpeed	3	2314	May 28, 2025
Multi-GPU is slower than single GPU when running examples 🤗Accelerate	2	448	July 24, 2024

Accelerate! I have a query, no actual problem to be solved!

Related topics