Need clarity on "padding" parameter in Bert Tokenizer

shivammavihs · December 8, 2022, 7:37am

Hi All,

I have been fine tuning a BERT model for sentence classification. In training, while tokenization I had passed these parameters padding="max_length", truncation=True, max_length=150 but while inferencing it is still predicting even if padding="max_length" parameter is not being passed.

Surprisingly, predictions are same in both the cases when padding="max_length" is passed or not but if padding="max_length" is not being passed, inferencing is much faster.

So, I need some clarity on the parameter “padding” in Bert Tokenizer. Can someone help me to understand how bert is able to predict even without the padding since the length of the sentences will differ and does it have any negative consequences If padding="max_length" is not passed while inferencing? Any help would be highly appreciated.

Thanks

Topic		Replies	Views
Bert Tokenizer Parameter Possible Values 🤗Transformers	0	250	October 8, 2021
Why does padding = 'max_length' cause much slower model inference? Models	1	621	June 8, 2023
How padding in huggingface tokenizer works? 🤗Tokenizers	4	6785	November 22, 2021
Purpose of padding and truncating Beginners	7	3337	August 3, 2020
Question about Bert padding part when calcualting similarity matrix Beginners	2	688	May 13, 2022

Need clarity on "padding" parameter in Bert Tokenizer

Related topics