TRL loss blowing up

RajSang · March 15, 2023, 1:37am

Hello @lvwerra , @natolambert , I am trying to use a Pegasus model and improve it in certain aspects using the TRL library. My reward function is based on ROUGE. While training it on a subset of the CNN dataset, the model loss seems to explode and the model outputs gibberish. Since I am new to this area, I needed some help understanding the problem. You can view the Wandb logs here.

Best,
Raj

lewtun · March 15, 2023, 2:03pm

Hi @RajSang could you please share a Colab notebook or a minimal example that reproduces your problem? That will help us better understand what’s going wrong

RajSang · March 16, 2023, 12:30am

Thanks for responding @lewtun , here is the colab notebook!

Topic		Replies	Views
Reproduce results on CNN/DailyMail Dataset Models	0	306	February 9, 2021
Reproducing bert2bert_cnn_dm result Models	0	234	June 30, 2021
PEGASUS model overfitting Research	2	463	May 19, 2021
```google/pegasus-cnn_dailymail``` generates blank files Models	0	303	April 15, 2021
VisionEncoderDecoder/TrOCR Models	0	702	October 21, 2021

TRL loss blowing up

Related topics