Fill-mask for BART with variable length

abdallah197 · February 9, 2022, 6:42pm

Hey All

i have tried to use BART in the fill-mask pipeline to predict masked tokens, but the output sometimes might be more than one word, and the pipeline does not have an option for that.

In the documentation, I found out this link to mask-filling using BART

from transformers import BartForConditionalGeneration, BartTokenizer

model = BartForConditionalGeneration.from_pretrained("facebook/bart-large", forced_bos_token_id=0)
tok = BartTokenizer.from_pretrained("facebook/bart-large")
example_english_phrase = "UN Chief Says There Is No <mask> in Syria"
batch = tok(example_english_phrase, return_tensors="pt")
generated_ids = model.generate(batch["input_ids"])
assert tok.batch_decode(generated_ids, skip_special_tokens=True) == [
    "UN Chief Says There Is No Plan to Stop Chemical Weapons in Syria"
]

My question is, how to use this to obtain the top 5 preditions. in fill-mask pipeline, this would be equivalent to

unmasker = pipeline("fill-mask", model=model_name, tokenizer=tokenizer, top_k=10)

The final output I am hoping to obtain is:

["plan to stop the war", .. etc another 5 predictions ]

abdallah197 · February 9, 2022, 6:43pm

@sgugger would you be able to lend a hand?

Topic		Replies	Views
Train Bart for Conditional Generation (e.g. Summarization) Models	14	17155	November 22, 2023
Is BART guaranteed to not mess up unmasked tokens during text infilling? Models	1	863	August 24, 2022
BART generate() output not related to input Intermediate	1	814	February 17, 2022
How to mask multiple tokens in BartForConditionalGeneration? Beginners	3	1092	July 12, 2022
Pretraining BART for conditional generation 🤗Transformers	1	976	May 30, 2022

Fill-mask for BART with variable length

Related topics