Setting specific device for Trainer

ben9004 · August 21, 2020, 2:08am

Yes.
My code is here

 torch.cuda.set_device(1)
 torch.cuda.current_device()
 1

epochs = 3
training_args = TrainingArguments(
do_predict=True,
output_dir=f'./results',
overwrite_output_dir=True,
do_train=True,
num_train_epochs=epochs,
per_device_train_batch_size=190,
logging_steps=10,
learning_rate=5e-05,
warmup_steps=500, 
save_total_limit = 100,
logging_dir='./logs',
save_steps=50)

training_args.device

out is

 device(type='cuda', index=0)

and

trainer = Trainer(
model=model,
args=training_args,
data_collator=data_collator,
train_dataset=train_dataset)

!nvidia-smi

I can see two GPUs
and nvidia-smi shows that the process is running with GPU #0 still.
It seems that torch.cuda.set_device(1) doens’t work at all.

Topic		Replies	Views
How to set gpu device for hugging trainer? 🤗Transformers	1	1097	September 16, 2024
How to restrict training to one GPU if multiple are available, co 🤗Transformers	4	14417	November 1, 2023
How to set the training device to cuda:1 ? By default, TrainerArgument seems to move model to cuda:0 Beginners	3	341	September 25, 2024
How to restrict Trainer to use certain GPUs? Beginners	2	535	February 25, 2024
Limit GPU cores for training 🤗Transformers	4	1545	September 14, 2023

Setting specific device for Trainer

Related topics