Missing keys "model.embeddings.position_ids" when loading model using state_dict

jyliu · August 18, 2020, 7:58am

I have saved the model like

model_state_dict = model.module.state_dict()
torch.save({‘model_state_dict’: model_state_dict}, osp.join(save_dir, ‘best.ckpt’))

Now I try to load the model like

model_path = “./models/best.ckpt”
ckpt = torch.load(model_path)
model.load_state_dict(ckpt[‘model_state_dict’])

then it has the error. There are “position_ids” in model, but not in the saved ckpt file.
Anyway to skip loading position_ids ?

valhalla · August 19, 2020, 5:29pm

Hi @jyliu, is there any specif reason for not using .save_pretrained and .from_pretrained

jyliu · August 20, 2020, 3:47am

Thanks for your reply. Because the bert-model is a part of my whole model. I directly saved the whole model. So what is the best practice under this situation ?

valhalla · August 20, 2020, 2:39pm

Will it be possible for you to create colab for this ?

Also, to take advantage of .from_pretrained and .save_pretrained you can sub-class the BertPretrainedModel and add the additional layers in it. See these task specific bert models, they use bert and additional layer on top of it and subclass BertPreTrainedModel.

let me know if this solves your problem.

jyliu · August 21, 2020, 4:02am

Thanks. This will work for me.

Topic		Replies	Views
Loading model from pytorch_pretrained_bert into transformers library 🤗Transformers	2	7974	January 5, 2022
Missing keys when loading a model checkpoint (transformer) Models	0	1552	November 10, 2021
Elegant way to load and save a pretrained model as part of other model? 🤗Transformers	0	850	June 9, 2022
How to save and load fine-tune model 🤗Transformers	4	24702	October 25, 2021
Clarify BERT model learnable parameters 🤗Transformers	4	568	November 4, 2021

Missing keys "model.embeddings.position_ids" when loading model using state_dict

Related topics