T5-base model create spelling mistake is summary

jiten-duhan · September 14, 2020, 9:33am

Hi,

I am using T5-base model for abstractive summarization, results are good but I am getting newly generated spelling mistakes in the summary which were not actually present in input text.
Can anyone tell me why these spelling mistakes occuring and how can I solve this?

Zack · October 24, 2020, 8:02pm

I think it’s due to your min_output size, for example if you have forced the model to generate results at minimum more than 50 sequence, and somehow the prediction length predicted only 40 sequences, I think it will start to generate random tokens just to reach the 50 seq.

jiten-duhan · November 5, 2020, 8:42am

Hi Zack, hope you are doing well !!
Thank you for your reply.

Actually it is not generating random tokens, but it is misspelling them.

For e.g a word “productive” in input text is spelled as “priductive”

Topic		Replies	Views
Using T5-Base via Inference API Models	1	994	November 17, 2021
Finetuning T5 for Summarisation - Poor results Intermediate	1	529	April 28, 2024
LongT5 tGlobal Base Extractive Result Models	0	137	September 1, 2023
Keyword generation using T5 Models	4	1989	November 2, 2022
T5 model for summarization far from SOTA results Models	0	1344	July 2, 2021

T5-base model create spelling mistake is summary

Related topics