SFTTrainer Loss function

ZeyadMahmoud · April 7, 2024, 11:51am

I have a couple of questions:

how to know the loss function used by default for SFTTrainer for a given model and how to alter it?
For training an LLM the loss function is computed on the whole concatenated prompts, how to alter this and make loss function only compute on the output prompts

abhijeet-ta · July 8, 2024, 8:07am

Hi,
I would recommend exploring DataCollatorForCompletionOnlyLM in HuggingFace for training LLM on outputs only!

nielsr · July 8, 2024, 8:12am

The loss function being used is the cross-entropy loss. It is defined within the model, e.g. here for llama. In case you want to use your own custom loss function, you can overwrite the compute_loss method of the Trainer as explained here.

For training an LLM the loss function is computed on the whole concatenated prompts, how to alter this and make loss function only compute on the output prompts

Indeed as recommended above, the DataCollatorForCompletionOnlyLM can be used for this purpose.

Topic		Replies	Views
How can I know what loss function I am using? Beginners	1	83	May 25, 2025
Fine-tuning queries Beginners	0	38	February 20, 2025
Create a weighted loss function to handle imbalance? 🤗Transformers	3	1268	May 21, 2025
Supervised Fine-tuning Trainer - Loss function calculation Beginners	0	3329	September 6, 2023
Custom loss trainer takes hours and validation loss starts so differently then test loss (Learning Without Forgetting) Intermediate	0	18	April 10, 2025

SFTTrainer Loss function

Related topics