Infilling multiple mask spans with BartForConditionalGeneration

jbmaxwell · July 12, 2022, 6:24pm

I’m using BartForConditionalGeneration to do sentence infilling and while I find it works for single <mask> tokens I can’t seem to adapt it to a multiple-mask context.

The problem seems to be that once it starts generating, it doesn’t “return” to the original sequence, but rather continues generating until it hits a stopping condition. Often times it will actually generate the first 2 or 3 tokens immediately following the <mask>, but soon after that it diverges again—i.e., it’s clearly not returning to inference-only, but rather actively generating and coincidentally replicating the original context following the mask (which is to be expected, since the generation is conditioned on that context).

I’m guessing this has something to do with how generate() works for this model, but I’m wondering if there’s a way around it? That is, is there a way to alternate dynamically between actively generating infilling tokens and passing through the unmasked tokens from the original input as context?

The only other solution I can think of is to run multiple iterations of generate(), each with only a single <mask> token, then compiling the result when they’re all done. Obviously this would be slower, but I’d expect it to work.

Any help or thoughts very much appreciated.

Topic		Replies	Views
How to mask multiple tokens in BartForConditionalGeneration? Beginners	3	1092	July 12, 2022
BART generate() output not related to input Intermediate	1	814	February 17, 2022
Is BART guaranteed to not mess up unmasked tokens during text infilling? Models	1	864	August 24, 2022
Pretraining BART for conditional generation 🤗Transformers	1	983	May 30, 2022
Proper way to do conditional generation with T5 Beginners	1	2079	January 20, 2023

Infilling multiple mask spans with BartForConditionalGeneration

Related topics