How to format NLI input for GPT-2 finetuning

preseshadri · June 7, 2022, 1:28am

I am performing a comparison of MNLI fine-tuning performance across different models, and am having trouble with fine-tuning on GPT-2. Since NLI takes two texts as input, the hypothesis and premise, I am unsure how to format the text. Is it sufficient to just feed in "Premise: Hypothesis: " as input, or do I need to use any tokens to separate the two? If you have suggestions on how to tokenize/process the input, that would be much appreciated. Thanks!

Topic		Replies	Views
GPT-2 Data Preparation for Parsing Trees Intermediate	0	123	May 6, 2024
How to separate sequences during finetuning gpt Beginners	0	292	December 19, 2020
GPT-Neo text vs text_target for Seq2Seq Task Models	0	445	October 24, 2022
What is the correct format of input when fine-tuning GPT2 for text generation with batch input? Models	0	506	January 22, 2024
GPT-2 full python tokenizer example for Q/A finetuning Beginners	1	862	December 27, 2022

How to format NLI input for GPT-2 finetuning

Related topics