GPT-Neo text vs text_target for Seq2Seq Task

ArsenieBoca · October 24, 2022, 12:05pm

Hi there,

I am finetuning a GPT-Neo model on a seq2seq task and am a bit confused about the details on how to fine-tune. Specifially, how to construct and tokenize:

Do I pass input and target to the tokenizer together in tokenizer(text=...) or should I use text for the input and text_target for the target? In [1] it looks like you can just concatenate both and pass to text, but how can the model differentiate between input and target?

Thanks

[1] gpt-neo-fine-tuning-example/gpt_neo.py at main · dredwardhyde/gpt-neo-fine-tuning-example · GitHub

Topic		Replies	Views
Which form needs the dataset to be for finetuning GPT-Neo? Models	0	336	December 29, 2022
How to provide a target and input separately for Trainer? 🤗Transformers	0	355	February 16, 2023
GPT-2 full python tokenizer example for Q/A finetuning Beginners	1	862	December 27, 2022
Train tokenizer for seq2seq model 🤗Tokenizers	0	340	April 19, 2024
How to fine-tune GPT on my own data for text generation Beginners	0	2188	January 17, 2022

GPT-Neo text vs text_target for Seq2Seq Task

Related topics