Can top_k be used with k=len(vocab)?

anon58275033 · August 7, 2021, 12:11pm

Hi,

When viewing the top predicted tokens in masked language modelling (MLM), is it possible to use top_k with k=len(vocab)?

So far, I have used this following line of code:

mask_filler("The capital of [MASK] is Paris", top_k=5)

Therefore, can k=len(vocab) be used, so that the predictions come from my vocabulary or not?

Thanks!

Topic		Replies	Views
[unused] tokens in predicting with MLM model Beginners	0	781	January 3, 2022
Best way to mask a multi-token word when using `.*ForMaskedLM` models 🤗Tokenizers	2	2299	April 4, 2022
Why are tokens missing in my trained MLM model? Beginners	0	287	September 1, 2021
Mask More Than one Word: 🤗Transformers	7	3299	October 24, 2022
Which loss function we use in Masked Language Modeling? Beginners	0	317	August 5, 2022