Finetuning GPT2 with user defined loss

aclifton314 · May 22, 2023, 10:21pm

@SUNM It is my understanding that if you train a model with the huggingface Trainer class (as you’ve shown in the post Finetuning GPT2 using Multiple GPU and Trainer) and you have multiple GPUs available on your system, then Trainer will use all of the available GPUs unless explicitly told not to do so.

So, if you have 4 GPUs available but you set the environment variable CUDA_VISIBLE_DEVICES=“1,2,3”, then Trainer will only utilize those 3 GPU instead of all 4 of them.

Topic		Replies	Views
Loading finetuned model to generate text 🤗Transformers	12	3311	August 7, 2023
GPT-2 fine-tuning Beginners	0	1609	June 12, 2023
Generate method during finetuning Beginners	6	1941	July 30, 2020
Using GPT-J for custom sequence classification Beginners	0	407	September 14, 2022
Need help with gpt2 model Beginners	0	585	July 9, 2023

Finetuning GPT2 with user defined loss

Related topics