How to Use torch_directml GPU with Transformers.Trainer for Fine-Tuning?

M-Masry · January 2, 2025, 6:13pm

Hello,

I’m trying to use a torch_directml device (GPU) for fine-tuning with the Transformers.Trainer from the Hugging Face library. However, I’ve noticed that the Trainer automatically switches to the CPU if neither a CUDA nor SMD device is available.

Even when I explicitly move the model to the DML device, it gets reverted to the CPU during training.

Is there a way to configure the Trainer to use a torch_directml device for fine-tuning? Any help or guidance would be much appreciated.

Thanks in advance!

Topic		Replies	Views
Fine tuning Hugging face models using DirectML 🤗Transformers	1	1383	April 15, 2023
Huggingface transformer sequence classification 🤗Transformers	3	502	March 26, 2022
Trainer use multigpu 🤗Transformers	0	531	July 29, 2021
Can I use CUDA with Trainer.train? Beginners	3	8033	May 10, 2022
How to get the Trainer API to use GPU? Beginners	0	1579	May 21, 2021

How to Use torch_directml GPU with Transformers.Trainer for Fine-Tuning?

Related topics