Is there a way to get per word loss instead of the average loss for GPT model

beston91 · March 7, 2022, 10:09pm

Reposting as it seems to suit this subtopic better.

Hi,

I would like to finetune a gpt2 model using a custom loss function, that will return zero loss for all but the last token in a sentence. However, the loss from the output of the model seems to be the averaged loss. Is it possible to get a per word loss.

Thanks in advance for any help.

Topic		Replies	Views
Can we get per word loss from the output of a GPT model Beginners	0	366	March 2, 2022
How to calculate GPT-2 sentence loss (for each sentence) if batch has 2 or more sentences? Beginners	1	833	April 17, 2023
GPT-2 custom loss Models	0	487	July 18, 2022
How to compute per-token loss when doing language modeling? 🤗Transformers	3	3230	August 23, 2023
Finetuning GPT2 with user defined loss Beginners	56	16089	July 23, 2023

Is there a way to get per word loss instead of the average loss for GPT model

Related topics