GPT2 Conditional Text Generation

manzar · September 28, 2020, 3:13pm

Hello,
I would like to fine-tune the GPT2 model on EmpatheticDialogues doing kind of conditional generation as like in this paper: https://arxiv.org/pdf/1911.11161.pdf
What concerns me is the format of the input_ids and labels in the forward function.
I think that concatenating the input with the target is a good solution separating them with a special token
(e.g. "hi! how are you? I am fine!)
However I am not sure what to do with the labels. Shall I mask all the input part and the padded tokens with -100 index and leave only the target part as is? or shall I mask with -100 only the padded tokens?

Thank you in advance

Topic		Replies	Views
Trying to Fine Tune GPT2 Story Generator but do I need labels? Beginners	0	282	April 15, 2023
Generate desired text output based on model training Intermediate	3	292	December 17, 2024
Shifting ids to the right when training GPT-2 on text generation? Beginners	4	2318	January 25, 2023
Newbie Understanding GPT2 loss 🤗Transformers	1	5097	March 12, 2023
How to generate a sequence using inputs_embeds instead of input_ids? 🤗Transformers	4	8444	April 17, 2022

GPT2 Conditional Text Generation

Related topics