Truncated last sentence on summaries

hf324 · September 24, 2020, 6:13pm

I am running the summarization finetuning on the latest master branch version of examples/seq2seq. I am using a custom dataset. However, the last sentence on some of the resulting summaries are truncated. The issue appears to worsen as I increase my dataset size, resulting in a greater proportion of truncated summaries. My parameters are as follows:

'--data_dir=.../data',
'--train_batch_size=1',
'--eval_batch_size=1',
'--output_dir=.../output',
'--num_train_epochs=5',
'--max_target_length=1024'
'--max_source_length=56'
'--model_name_or_path=facebook/bart-large'

Here is a very small data set that I was able to reproduce the issue with (500 training instances).

Is this expected? Any insights would be helpful. Thank you!

vhartman6 · April 11, 2021, 7:51pm

I’m experiencing this same issue with BART transformer and I created a Stackoverflow post about the issue: https://stackoverflow.com/questions/66996270/limiting-bart-huggingface-model-to-complete-sentences-of-maximum-length

Here are some of the output summaries with truncated sentences:

EX1: The opacity at the left lung base appears stable from prior exam. There is elevation of the left hemidi
EX 2: There is normal mineralization and alignment. No fracture or osseous lesion is identified. The ankle mort

Were you able to find a solution to this problem you encountered @hf324?

ffer · March 30, 2023, 3:08pm

I’m facing the same problem and still didn’t find out the solution.

Topic		Replies	Views
Does generate's max_length influence training? 🤗Transformers	0	103	April 25, 2024
Finetuning BART for Abstractive Text Summarisation Beginners	1	5241	September 9, 2024
How to increase the length of the summary in Bart_large_cnn model used via transformers.Auto_Model_frompretrained? Beginners	1	999	November 15, 2021
Bart input confusion Beginners	2	3895	September 14, 2020
Problem fine-tuning a model with Seq2Seq Trainer Beginners	1	992	June 25, 2023

Truncated last sentence on summaries

Related topics