How is the prompt + answer handled during training

AdL8 · March 20, 2024, 5:28pm

Hello,

I am really confused on how the model train on the prompt and the answer of the prompt. Does the model use the whole prompt as a context and tries to predict each token of the answer, sliding the context window. Or does it start the training by trying to predict also each token in the prompt ?

Topic		Replies	Views
Fine-tune with SFTTrainer Beginners	0	24	August 8, 2024
Train a model for document specific Q and A Community Calls	0	1007	February 19, 2023
Training in a long prompt Beginners	3	387	January 15, 2024
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers Beginners	8	5282	November 20, 2021
Get the predictions using DataCollator For Completion OnlyLM after fine-tuning Llama2 using SFT trainer 🤗Transformers	0	519	November 13, 2023

How is the prompt + answer handled during training

Related topics