Nucleus Sampling copying input

Skylixia · April 28, 2021, 9:39pm

Hi,

I trained the mBART by using different decoding methods for summarization. I found that nucleus sampling ends up just copying entire sentences from the input … My training data is not at all extractive and the other decoding methods didnt do this. Is this a common behavior with nucleus sampling ? I understand the decoder sample from a cumulative distribution above p but how can it be that it would reproduce exact sequences from the input ?

Thanks in advance for your input

Topic		Replies	Views
Using nucleus sampling and temperature at the same time Models	0	431	June 27, 2023
Help with finetuning mBART on an unseen language Models	19	2057	October 30, 2020
MBart Zero Shot Transfer Learning Beginners	0	350	June 4, 2021
Slow inference while performing translation Intermediate	0	604	June 10, 2022
'T5' generates almost the same input! Beginners	1	375	September 21, 2022

Nucleus Sampling copying input

Related topics