Stable Diffusion FP16 on multi-GPU setups?

zaptrem · August 24, 2022, 7:51pm

Hello.

With most HuggingFace models one can spread the model across multiple GPUs to boost available VRAM by using HF Accelerate and passing the model kwarg device_map=“auto”

However, when you do that for the StableDiffusion model you get errors about ops being unimplemented on CPU for half(). Is there a way around this without switching to FP32 (e.g., device_map to everything except CPU or dynamically swap model parts from RAM to VRAM as needed?)

Thanks!

Topic		Replies	Views
Multi-GPU Issue when trying Diffusers demo 🤗Accelerate	0	560	June 16, 2024
Having trouble accelerate on my 2 GPU machine Beginners	0	736	May 24, 2023
Stable diffusion `train_text_to_image.py` only on one gpu 🧨 Diffusers	5	1191	May 2, 2023
Running into out of memory issues 🧨 Diffusers	2	1424	April 24, 2023
Out of Memory error with multi-gpu training but no error with just one gpu? Amazon SageMaker	0	463	December 12, 2023

Stable Diffusion FP16 on multi-GPU setups?

Related topics