Sequence to sequence model

fahad0521 · November 22, 2024, 6:18am

I have a query that since we use the cross entropy loss as main metric in both train and val to detemine the model performance in terms of overfitting underfitting and generalization right? then as we use the teacher forcing in the train then we compute the loss do we also use the teacher forcing not the autoregressive technique in the validation to get same logists equal to the ground truth tokens to compute the loss
just the difference is that we donot update the parmaters in this validation phase
and autoregressive is ONLY used in the test phase?
kindly some one can help me and also provide some referece if you may know

Topic		Replies	Views
How to calculate the loss in validation Beginners	0	46	November 21, 2024
How is the loss comuted for sequence to sequence models? Beginners	9	5692	November 22, 2024
What to use for the target input in the decoder for autoregressive usage 🤗Transformers	5	4124	September 16, 2021
Tranformers Trainer API Intermediate	0	63	November 25, 2024
Seq2SeqTrainingAguments Beginners	0	260	January 26, 2023

Sequence to sequence model

Related topics