Use custom loss function for training ML task

Roman · November 4, 2021, 10:14am

Hello.
I’d like to train BERT from stratch on my custom corpus for the Masked Language Modeling task. But corpus has one specific - it is sequence of the numbers and absolute value of the difference of two words corresponds to its proximity. Therefore I guess I should use this difference(or some similar) as loss function during training. Is it possible to use custom loss function training BERT model fo ML task?

sgugger · November 4, 2021, 11:54am

You can compute the loss outside of your model since it returns the logits, and apply any function you like.

If you question was related to the Trainer, you should definte your subclass with a compute_loss method. There is an example in the documentation (scroll a bit down).

bui · March 17, 2022, 7:53am

hi @sgugger,
in the link you attached above, I have a question related to the example.
why do we need this line of code when computing loss?

loss = loss_fct(logits.view(-1, self.model.config.num_labels), labels.view(-1))

I mean the .view() method. why do we have to reshape logits tensors?

Topic		Replies	Views
Trainer code for token-wise prediction model Intermediate	0	436	June 6, 2022
Custom Training Loss Function for Seq2Seq BART Beginners	1	1736	July 21, 2023
Having troubel in understanding what loss is currently in use Beginners	1	760	November 24, 2023
Transformers replacing loss function 🤗Transformers	0	3380	March 26, 2022
BertForMaskedLM’s loss and scores, how the loss is computed? 🤗Transformers	13	25089	September 22, 2023

Use custom loss function for training ML task

Related topics