Loading the Mdeberta-v3-base

Mei234543 · March 13, 2025, 10:47am

While loading the “microsoft/deberta-v3-base” model, I am getting unexpected warning message.

Code : model = AutoModelForTokenClassification.from_pretrained(“microsoft/deberta-v3-base”)

WARN : “Some weights of DebertaV2ForTokenClassification were not initialized from the model checkpoint at microsoft/deberta-v3-base and are newly initialized: [‘classifier.bias’, ‘classifier.weight’] You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.”

???: Why AM I seeing "DebertaV2ForTokenClassification " while loading V3 model.

Is the DEVs called the print warn function with different args?

John6666 · March 13, 2025, 5:18pm

There are a number of possible causes, but the model hasn’t been updated for about two years, so if there is a problem, it’s probably because it hasn’t been updated since then, or because of changes to the library since then. Well, I think you can ignore the warnings and it will still work…

github.com/huggingface/transformers

What to do about this warning message: "Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForSequenceClassification"

opened 01:31AM - 01 Jul 20 UTC

closed 06:37PM - 01 Jul 20 UTC

ohmeow

``` model = AutoModelForSequenceClassification.from_pretrained("bert-base-uncas…ed") ``` returns this warning message: ``` Some weights of the model checkpoint at bert-base-uncased were not used when initializing BertForSequenceClassification: ['cls.predictions.bias', 'cls.predictions.transform.dense.weight', 'cls.predictions.transform.dense.bias', 'cls.predictions.decoder.weight', 'cls.seq_relationship.weight', 'cls.seq_relationship.bias', 'cls.predictions.transform.LayerNorm.weight', 'cls.predictions.transform.LayerNorm.bias'] - This IS expected if you are initializing BertForSequenceClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPretraining model). - This IS NOT expected if you are initializing BertForSequenceClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model). Some weights of BertForSequenceClassification were not initialized from the model checkpoint at bert-base-uncased and are newly initialized: ['classifier.weight', 'classifier.bias'] You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference. ``` This just started popping up with v.3 so I'm not sure what is the recommended action to take here. Please advise if you can. Basically, any of my code using the `AutoModelFor<X>` is throwing up this warning now. Thanks.

Mei234543 · March 13, 2025, 6:00pm

I am not sure you can see the jpeg i have attached.

the model config itself is pointing to V2.

I wondering how those 158 finetunes are using, is it V2 or V3?

if V3, it’s totally fine but if V2 those guys are doomed for real after realizing.

Mei234543 · March 13, 2025, 6:00pm

Is there a way to report this to devs?

John6666 · March 13, 2025, 6:05pm

Generally, if the person you are talking to is an individual, sending a mention with @+username is a good way to get their attention, but if the person you are talking to is an organization, mentions don’t work.
Also, since the person you are talking to is Microsoft and not Hugging Face, there is no way around it…

So, the most appropriate way to report this is to start a new discussion in the Discussion section.

John6666 · March 13, 2025, 6:08pm

However, there are cases where the code for the same model class can be reused, as is the case with Qwen 2 and 2.5. But V2 to V3 is probably a major version upgrade, so it’s probably still a bit strange.

Topic		Replies	Views
Uninitiallized weights with supposed correct architecture Models	1	330	October 6, 2023
DebertaForMaskedLM cannot load the parameters in the MLM head from microsoft/deberta-base Models	3	1324	April 29, 2022
Weights not downloading Beginners	3	1839	May 24, 2021
"Some weights were not used" message with AutoModel Beginners	4	1936	May 21, 2024
Is "Some weights of the model were not used" warning normal when pre-trained BERT only by MLM Beginners	6	18397	March 28, 2024

Loading the Mdeberta-v3-base

Related topics