Do I need to apply the softmax function to my logit before calculating the CrossEntropyLoss?

h56cho · October 15, 2020, 5:22pm

Hello, I am trying to compute the CrossEntropyLoss directly by using this code:

loss_fct = CrossEntropyLoss()
mc_loss = loss_fct(reshaped_logits, mc_labels)

If the reshaped_logits contain the logit values before softmax, should I apply nn.softmax function before I do loss_fct(reshaped_logits, mc_labels)? Thank you,

sgugger · October 15, 2020, 6:24pm

PyTorch CrossEntropyLoss combines softmax and the cross-entropy loss, so no.

Topic		Replies	Views
What is the loss function of a pre-trained T5 model? Models	1	1198	June 19, 2023
Help with custom loss function Beginners	0	2051	February 21, 2022
Gradient accumulation loss compute Beginners	0	75	June 4, 2024
Negative "cross entropy" loss function 🤗Transformers	0	1538	December 15, 2022
Custom Training Loss Function for Seq2Seq BART Beginners	1	1724	July 21, 2023

Do I need to apply the softmax function to my logit before calculating the CrossEntropyLoss?

Related topics