Load frozen layers from one checkpoint and new layers from second checkpoint?

Avelina · October 23, 2024, 4:40pm

I have a custom model which adheres to the hugginface spec which allows me to used save_pretrained and from_pretrained to correctly save and load models from local directories.

After pre-training the model I want to finetune the model with extra layers. All the original layers stay frozen and new layers are introduced for the finetuning task. When I save this new model I want to be able to save only the unfrozen layers to save disk space, and then when loading this model I want to load the original pre-trained layers from the first checkpoint and the new layers from the second checkpoint. Is this at all possible using the functions provided by the transformers library? This is similar to loading adaptors from a PEFT model, however I am not using the PEFT library and nor am I actually using actual adaptors as these are entirely new components present in the fine-tuned model class, but absent in the pre-trained model class.

I want to be able to do something like this:

modelA = MyCausalModel()
train( modelA )
modelA.save_pretrained( '/modelA' )

...

modelB = MyCausalModelWithHead.from_pretrained( '/modelA' )
modelB.backbone._requires_grad( False )
modelB.new_layers._requires_grad( True )
train( modelB )
modelB.save_pretrained( '/modelB', only_new_layers=True )

...

restored_modelB = MyCausalModelWithHead.from_pretrained( 'modelA', 'modelB' )

Topic		Replies	Views
Saving pretrained to same directory as load 🤗Transformers	2	67	April 23, 2025
Saving a fine-tuned model Beginners	0	383	June 30, 2021
Correct way to save/load adapters and checkpoints in PEFT 🤗Transformers	8	14817	August 15, 2024
How to save and load fine-tune model 🤗Transformers	4	24702	October 25, 2021
Saving custom and/or finetuned models without the HUB Beginners	3	1050	March 2, 2022

Load frozen layers from one checkpoint and new layers from second checkpoint?

Related topics