【Solved】How can I get loss by using trainer when training gpt2?

Azily · July 20, 2022, 2:51pm

I want to use trainer to train my gpt-2 model, while when I pass the dataset to the trainer, it tell me that there is a keyError: ‘loss’. So I try to find the cause and find that the function ‘compute_loss’ is not work as what I want. So I write a new compute_loss function but find that the inputs don’t have the column ‘labels’, which I create when using tokenizer, I think is may be deleted when used by trainer.

I wonder how should I use trainer correctly so that I can make compute_loss work well

this is my compute_loss function, but not work because it does not find a column named labels

Azily · July 20, 2022, 2:53pm

there are more pictures about the problem
this is my dataset, it is only used to test how to use trainer so I don’t distinguish them

Azily · July 20, 2022, 2:53pm

this is my trainer

Azily · July 21, 2022, 8:52am

By using GPT2LMHeadModel instead of GPT2Model, the problem doesn’t appear again. So I wonder what is the difference between them.

Topic		Replies	Views
Troubleshoot KeyError: loss Beginners	3	321	January 12, 2023
Key Error 'loss' while fine tuning GPT-2 with the Trainer utility 🤗Transformers	9	7468	May 10, 2022
GPT2DoubleHeadsModel Beginners	0	223	June 4, 2023
Newbie Understanding GPT2 loss 🤗Transformers	1	5097	March 12, 2023
Why am I getting KeyError: 'loss'? Beginners	9	16465	March 17, 2023

【Solved】How can I get loss by using trainer when training gpt2?

Related topics