Can we get per word loss from the output of a GPT model

beston91 · March 2, 2022, 3:49pm

Hi,

I would like to finetune a gpt2 model using a custom loss function, that will return zero loss for all but the last token in a sentence. However, the loss from the output of the model seems to be the averaged loss. Is it possible to get a per word loss.

Thanks in advance for any help.

Topic		Replies	Views
Is there a way to get per word loss instead of the average loss for GPT model 🤗Transformers	0	328	March 7, 2022
How to calculate GPT-2 sentence loss (for each sentence) if batch has 2 or more sentences? Beginners	1	833	April 17, 2023
GPT-2 custom loss Models	0	487	July 18, 2022
How to compute per-token loss when doing language modeling? 🤗Transformers	3	3241	August 23, 2023
Loss in a Seq2Seq task 🤗Transformers	0	156	June 5, 2024

Can we get per word loss from the output of a GPT model

Related topics