Is there a good/easy way to know what blocks should in `no_split_module_classes` when using multi GPU setup?

ahans1 · July 14, 2023, 8:32pm

Each time I use device_map='auto' via accelerate I get the RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! exception.

This doesn’t occur only when I specify the appropriate no_split_module_classes in load_checkpoint_and_dispatch method for the model I’m working on. Is there an easy way to determine which blocks should be not split across GPUs for given model/checkpoint?

Topic		Replies	Views
RuntimeError: Expected all tensors to be on the same device, but found at least two devices Beginners	0	95	November 30, 2024
Unable to train Bert by splitting across GPUs 🤗Transformers	0	456	June 24, 2022
Multi-GPU finetuning of NLLB produces RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0 🤗Transformers	2	1123	June 9, 2025
Trainer.evalute() with multi GPUs results Expected all tensors to be on the same device, but found at least two devices, cuda:3 and cuda:0! Beginners	2	83	February 11, 2025
Finetuning T5-large on Multiple GPUs 🤗Transformers	0	1080	April 26, 2023

Is there a good/easy way to know what blocks should in `no_split_module_classes` when using multi GPU setup?

Related topics