SFTTrainer Loss function

nielsr · July 8, 2024, 8:12am

The loss function being used is the cross-entropy loss. It is defined within the model, e.g. here for llama. In case you want to use your own custom loss function, you can overwrite the compute_loss method of the Trainer as explained here.

For training an LLM the loss function is computed on the whole concatenated prompts, how to alter this and make loss function only compute on the output prompts

Indeed as recommended above, the DataCollatorForCompletionOnlyLM can be used for this purpose.

Topic		Replies	Views
How can I know what loss function I am using? Beginners	1	86	May 25, 2025
Fine-tuning queries Beginners	0	39	February 20, 2025
Create a weighted loss function to handle imbalance? 🤗Transformers	3	1355	May 21, 2025
Supervised Fine-tuning Trainer - Loss function calculation Beginners	0	3334	September 6, 2023
Custom loss trainer takes hours and validation loss starts so differently then test loss (Learning Without Forgetting) Intermediate	0	18	April 10, 2025

SFTTrainer Loss function

Related topics