Model.save_pretrained() does not save layer changes

DuanXR · September 18, 2023, 9:55am

I trained a model based on BeitForImageClassification, and for some reason I needed to remove the classifier layer,I did it by calling this, and it worked fine.

pretrained_model.classifier = torch.nn.Identity()

And this is the current model structure:

========================================================================================================================
BeitForImageClassification                                             [1, 768]                  --
├─BeitModel: 1-1                                                       [1, 768]                  --
│    └─BeitEmbeddings: 2-1                                             [1, 197, 768]             768
│    │    └─BeitPatchEmbeddings: 3-1                                   [1, 196, 768]             590,592
│    │    └─Dropout: 3-2                                               [1, 197, 768]             --
│    └─BeitEncoder: 2-2                                                [1, 197, 768]             --
│    │    └─ModuleList: 3-3                                            --                        85,169,088
│    └─Identity: 2-3                                                   [1, 197, 768]             --
│    └─BeitPooler: 2-4                                                 [1, 768]                  --
│    │    └─LayerNorm: 3-4                                             [1, 768]                  1,536
├─Identity: 1-2                                                        [1, 768]                  --
========================================================================================================================

However it seems this change was not saved to the local model file with save_pretrain(), and when I called BeitForImageClassification.from_pretrained to read my saved model, the classifier went back to its previous state:

========================================================================================================================
BeitForImageClassification                                             [1, 589]                  --
├─BeitModel: 1-1                                                       [1, 768]                  --
│    └─BeitEmbeddings: 2-1                                             [1, 197, 768]             768
│    │    └─BeitPatchEmbeddings: 3-1                                   [1, 196, 768]             590,592
│    │    └─Dropout: 3-2                                               [1, 197, 768]             --
│    └─BeitEncoder: 2-2                                                [1, 197, 768]             --
│    │    └─ModuleList: 3-3                                            --                        85,169,088
│    └─Identity: 2-3                                                   [1, 197, 768]             --
│    └─BeitPooler: 2-4                                                 [1, 768]                  --
│    │    └─LayerNorm: 3-4                                             [1, 768]                  1,536
├─Linear: 1-2                                                          [1, 589]                  452,941
========================================================================================================================

Here’s my code:

new_model_path = model_path + "_new"
    pretrained_model = BeitForImageClassification.from_pretrained(model_path, ignore_mismatched_sizes=True)
    pretrained_model.classifier = torch.nn.Identity()
    summary(model=pretrained_model, input_size=(1, 3, 224, 224))
    pretrained_model.save_pretrained(new_model_path)

    new_model = BeitForImageClassification.from_pretrained(new_model_path, ignore_mismatched_sizes=True)
    summary(model=new_model, input_size=(1, 3, 224, 224))

Did I miss something?

Topic		Replies	Views
Trainer's `save_model` isn't saving the entire state_dict and is only saving the embedding/encoder Beginners	1	1501	January 2, 2024
Model.save_pretrained is not saving .bin files! model.push_to_hub is not pushing my model in my HuggingFace directory! What am I missing? Help Beginners	11	4085	February 25, 2025
Model loading and saving seems to change the model file 🤗Transformers	0	389	April 22, 2021
Saving local bert/roberta model not working using save_pretrained Beginners	0	1070	November 18, 2022
Save a Bert model with custom forward function and heads on Hugginface Intermediate	1	1969	June 7, 2022

Model.save_pretrained() does not save layer changes

Related topics