Increasing Perplexity when fine-tuning GPT-2

lcrivell · November 20, 2020, 7:07pm

Hello,

I have a question regarding evaluation when fine-tuning GPT-2. My task is training GPT-2 to write a short story. The input file consists of a short-set of stories each with the following structure:

t Title kw outline b body

For example: _t_Harry Potter _kw_Harry goes to Hogwards b Story

My goal is to give GPT-2 the title and outline as prompt and have it generate the body.

I have added all three special tokens to the model (t,kw,b) but the more I train, the bigger perplexity gets (Ironically, the generations improves from a human point of view).

Did I miss something? Should I change the evaluation ? (So that it gets the title+outline and evaluates the generated body) If so, where exactly would I need to look to change it?

Thanks for the help!

Topic		Replies	Views
Evaluation results in training GPT-2 on WikiText-2 Beginners	4	1643	April 14, 2021
GPT-2 fine-tuning Beginners	0	1613	June 12, 2023
Huge discrepancy in perplexity of LLM for Trainer v/s scratch implementation? Beginners	1	132	October 24, 2024
Fine tuning and retokenizing Beginners	0	589	May 29, 2022
Trying to Fine Tune GPT2 Story Generator but do I need labels? Beginners	0	282	April 15, 2023

Increasing Perplexity when fine-tuning GPT-2

Related topics