Loss from calling model and computing explicitly don't match

acharuva · June 30, 2023, 2:15am

Hi All,

  I'm calling the model like the following

output = model(input, labels=target_ids)
loss1 = output.loss

loss2 = F.cross_entropy(outputs.logits, target_ids)

I expect both the losses to be same but they aren’t. What am I missing?
The model is of class GPT2LMHeadModel

Topic		Replies	Views
Additional loss logging 🤗Transformers	1	643	January 4, 2024
Difference between model.generate() and model() outputs Intermediate	2	2737	March 3, 2024
Negative "cross entropy" loss function 🤗Transformers	0	1541	December 15, 2022
Token probabilities don't agree with the output loss Beginners	1	1309	November 15, 2022
Loss in a Seq2Seq task 🤗Transformers	0	156	June 5, 2024