fine-tune Pegasus with xsum using Colab but generation results have no difference

harrywang · March 8, 2021, 8:45am

Hi. I tried to fine-tune pegasus large with xsum dataset using Colab (Pro). I was able to finish the fine-tuning with batch size 1, and 2000 epochs in about 40 minutes (larger batch size crashed colab). The working Colab notebook I used is shared at https://colab.research.google.com/drive/1RyUsYDAo6bA1RZICMb-FxYLszBcDY81X?usp=sharing

However, the generated summary seems to be the same for the pegasus large model (google/pegasus-large · Hugging Face) and the fine-tuned model. But the generated result using pegasus xsum model (google/pegasus-xsum · Hugging Face) is different and much better.

The training loss is already 0 and I am not sure what I have done wrong. Any help and pointers are highly appreciated.

Topic		Replies	Views
Finetuning Pegasus for summarization task 🤗Transformers	3	1046	October 14, 2020
Fine-tuning BigBirdPegasus Models	0	454	October 13, 2021
Pegasus finetuning, should we always start with pegasus-large? Beginners	5	1673	May 3, 2024
Google/pegasus-xsum for summerization is very slow Beginners	2	208	February 26, 2024
Fine-tuning Pegasus Models	33	10115	October 14, 2021