Do I need to worry about this bert.dense.pooler training warning for my usecase?

athiban2001 · March 25, 2022, 5:40am

I am currently using a modified scibert model that is finetuned using masked language modelling via a large corpus and new tokens are also added. I am planning to use this model as a embedding layer for my another model. Currently I get this warning.

Some weights of BertModel were not initialized from the model checkpoint at athiban2001/cord-scibert and are newly initialized: ['bert.pooler.dense.bias', 'bert.pooler.dense.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.

I am only going to using this bert model as a embedding layer only. So do I need to worry about this warning for extracting embeddings for a sub tokens?

Topic		Replies	Views
Some weights of BertModel were not initialized from the model checkpoint 🤗Transformers	6	11317	January 15, 2024
Uninitiallized weights with supposed correct architecture Models	1	330	October 6, 2023
Warning when using ESM pre-trained model 🤗Transformers	2	1631	December 26, 2023
Is "Some weights of the model were not used" warning normal when pre-trained BERT only by MLM Beginners	6	18367	March 28, 2024
Model weights warning while loading any model from HuggingFace models 🤗Transformers	2	855	September 21, 2021

Do I need to worry about this bert.dense.pooler training warning for my usecase?

Related topics