Bug Report: Mask token mismatch with the model on hosted inference API of Model Hub

ethanyt · May 31, 2021, 1:25am

In my model card, I used to be able to run the hosted inference successfully, but recently it prompted an error: "<mask>" must be present in your input.

My model uses RoBERTa MLM and BERT Tokenizer. So the mask token is actually “[MASK]”. I have already set it in tokenizer_confg.json but the inference API still mismatches with that.

In the past it is OK but recently it turns to prompt an error. Seems like the front-end start to double-check the mask token. How can I set the mask token in an appropriate way? Is it documented to set mask token in inference API?

Thanks!

To reproduce

Steps to reproduce the behavior:

Go to ethanyt/guwenbert-base · Hugging Face
Run an example with “[MASK]”

Expected behavior

In the past it was OK. See snapshot in guwenbert/README_EN.md at main · Ethan-yt/guwenbert · GitHub

julien-c · May 31, 2021, 4:40pm

Should be resolved in Mask token mismatch with the model on hosted inference API of Model Hub · Issue #11884 · huggingface/transformers · GitHub, but if possible do not open duplicate issues/forum posts. Thanks!

Topic		Replies	Views
Mask token mismatch with the model on hosted inference API of Model Hub Model cards	1	1966	June 1, 2021
Hosted inference ignores attention mask resulting in wrong predictions 🤗Transformers	0	286	May 5, 2023
How to use the Rostlab/prot_bert fill-mask pipeline 🤗Transformers	1	567	October 22, 2020
Unexpected result from transformer model prediction Beginners	0	288	November 21, 2021
Deploying to Model Hub for Inference with custom tokenizer Beginners	1	623	January 1, 2022

Bug Report: Mask token mismatch with the model on hosted inference API of Model Hub

To reproduce

Expected behavior

Related topics