Why is the average length of generated summaries during Hugging Face text summarization training much smaller than the actual average length of the training data?

Hasnat · March 8, 2023, 5:02pm

Hello,
I’m using Hugging Face’s text generation model to train a model on my own dataset for text summarization, and I’m noticing that the average length of generated summaries during training is much smaller than the actual average length of the training data.

Here are some details about my setup:

I’m using Seq2SeqTrainer API in Hugging Face’s Transformers library to train the google/flan-t5-small model for text summarization.
The average length of the target output text in my training data is around 270 words, while the average length of the summaries is only around 20 words.
The final rouge1 score I get after training for 20 epochs is 25.

In this context, I would like to ask several questions:

Why the model is generating shorter texts of average 20 words while the training data has average 270 words?
How can I force the model to generate longer summaries during training.
I believe that if the model increases the length of generation then it will increase the rouge1 score too. Please share your feedback on this too.

Thank you.

pybeebee · November 20, 2023, 12:57am

Just curious were you able to resolve/understand this? If so how?

Topic		Replies	Views
Which summarization model of huggingface supports more than 1024 tokens? Which model is more suitable for programming related articles? 🤗Transformers	1	1749	July 31, 2023
Hugging Face Inference API returning short generated text with GPT-2 model Beginners	3	1713	July 18, 2023
T5 Gen Len is only 1/14 of max_target_length Beginners	3	727	October 5, 2023
Summarization: Is finetune_trainer.py accepting length arguments correctly? Beginners	9	2316	December 19, 2020
Transformers, limiting output to 200 words 🤗Transformers	0	290	August 23, 2022

Why is the average length of generated summaries during Hugging Face text summarization training much smaller than the actual average length of the training data?

Related topics