How to control the GPU id for loading model weights when fintune Llama8B model with the Trainer?

YalunHu · May 17, 2024, 4:11am

Hey,
Recently I tried to finetune the Llama8B model use the Trainer. I noticed that when trainer start, it automatically split the model’s weights and distributes those weight on different GPUs, I have eight GPU for now, but I’ld like just use 2 or 3 of them. Instead of seting CUDA_VISIBLE_DEVICE env arg, is there any other solution? I go through the TrainArguments ooptions but seem found no clue

Topic		Replies	Views
Trainer API for Model Parallelism using AutoModelForQuestionAnswering 🤗Transformers	1	144	June 5, 2024
Trainer, device error cuda:0 and cuda:1 🤗Transformers	3	3360	January 17, 2024
Setting specific device for Trainer Beginners	25	41737	July 21, 2024
Training llama with Lora on multiple GPUs may exist bug 🤗Transformers	10	9520	August 25, 2023
Load a large model to multipe, specific GPUs (without CUDA_VISIBLE_DEVICES) 🤗Transformers	0	164	November 22, 2024

How to control the GPU id for loading model weights when fintune Llama8B model with the Trainer?

Related topics