How to train LLM only on response

sd3ntato · July 19, 2023, 7:35am

Hello, how do I train the model only on responses rather than prompt and response?

Is it just a matter of attention masks?

nielsr · July 24, 2023, 2:45pm

Hi,

The easiest is to use the SFTTrainer of trl, combined with the DataCollatorForCompletionOnlyLM. The latter allows to only train on responses, and not on the prompts.

It’s brand new, we’re adding docs for it here: Add `DataCollatorForCompletionOnlyLM` in the docs by younesbelkada · Pull Request #565 · lvwerra/trl · GitHub

sd3ntato · July 24, 2023, 2:46pm

very interesting, thanks!

Topic		Replies	Views
Fine tune with SFTTrainer Intermediate	17	14124	September 12, 2024
Instruction tuning llm Beginners	8	12379	May 8, 2024
Training in a long prompt Beginners	3	386	January 15, 2024
Training causal LM from scratch - forcing prompt during training Beginners	0	286	February 11, 2022
SFTTrainer Loss function Beginners	2	4762	July 8, 2024

How to train LLM only on response

Related topics