Is teacher forcing included in GPT2

OmarHaroon01 · June 20, 2024, 7:27pm

I have been recently studying about GPT2. Can someone tell me whether Decoder only Models uses teacher forcing like Encoder Decoder models?

I have seen the GPT2 implementation of huggingface and they use the labels to only calculate loss. How does the model

Topic		Replies	Views
Teacher Forcing with T5 Models	0	635	February 12, 2021
Getting started with GPT2 Beginners	1	513	December 26, 2021
Is there any reason why GPT-Neo would behave differently (fundamentally) from GPT2? Models	0	425	January 15, 2023
Inserting custom layer in GPT-2 Beginners	0	305	September 27, 2022
Help converting model weights from polycoder gpt-neox 🤗Transformers	1	440	August 11, 2022