Correct way to save/load adapters and checkpoints in PEFT

remorax98 · March 18, 2024, 1:38am

Hi,

It is not clear to me what is the correct way to save/load a PEFT checkpoint, as well as the final fine-tuned model. There have been reports of trainer.resume_from_checkpoint not working as expected [1][2][3], each of which have very few replies, or do not seem to have any sort of consensus. Proposed solutions range from trainer.save_model, to trainer.save_state to resume_from_checkpoint=True to model.save_pretrained (PEFT docs) to even a very complicated procedure of merging and saving the model [4].

It is very confusing trying to figure out the correct solution between these, especially if resume_from_checkpoint can be buggy. Loading/saving models should really not be this confusing, so can we resolve once and for all what is the officially recommended (+tested) way of saving/loading adapters, as well as individual checkpoints during training? Can we update the HF docs accordingly, and simplify this process?

@sgugger

not-lain · March 22, 2024, 5:46am

Hi @remorax98
unfortunately sylvian is no longer a part of (just wanted to clarify this since there are lots of people who kept tagging him)

also for loading your PEFT model for continuous training there is a very easy parameter around this it’s called is_trainable which should allow you to load your PEFT model in a trainable state, then you can continue your training easily.
how to use
for my repo in not-lain/Gemma-2b-Peft-finetuning all i have to do is

# most of this code is from the button at the top right corner on 🤗
from peft import PeftModel, PeftConfig
from transformers import AutoModelForCausalLM

config = PeftConfig.from_pretrained("not-lain/Gemma-2b-Peft-finetuning")
model = AutoModelForCausalLM.from_pretrained("google/gemma-2b")
model = PeftModel.from_pretrained(model, 
"not-lain/Gemma-2b-Peft-finetuning",
 is_trainable=True # 👈 here
)
# check if it's working
model.print_trainable_parameters()
# >>> trainable params: 9,805,824 || all params: 2,515,978,240 || trainable%: 0.3897420034920493

heart react this comment if this solved the problem for you

remorax98 · March 22, 2024, 9:51am

Hi @not-lain thanks for letting me know about Sylvain! Sorry, my bad.

Regarding this parameter, this is good to know and quite useful for me. But it does not answer my specific question - I am more concerned with the proper way for saving adapters and checkpoints in HF, and the lack of clarity in documentation regarding the same.

Thanks anyway!

not-lain · March 22, 2024, 5:34pm

don’t mention it @remorax98.
I also made this notebook for you explaining all steps from the initial training then recalling the model and continue training, it took me a lot of time to get it done, but hope this helps you out.

if this notebook helped you clarify how to use PEFT, please consider marking this conversation as solved

regards,
hafedh hichri

adiudiun · April 18, 2024, 4:07pm

Guys can you help with proper examples of storing it to the file system instead of pushing to service? I tried
trainer.model.save_pretrained("shake_adapter")
and then

shmodel = PeftModel.from_pretrained(
    base_model,
    "shake_adapter",
    is_trainable=True
)

but it does not update my base model at all

not-lain · April 21, 2024, 3:14am

@adiudiun the way PEFT works is it uses adapters to train the model
the image below explains how you can visualize lora.

I think what you’re looking for is the save_embedding_layers, could you try setting it manually to True ?

akashghimire · May 16, 2024, 6:43pm

@adiudiun, I had the same problem. The correct way is to first load the base_model using AutoModel.from_pretrained(). Then, load the adapter config using PeftConfig.from_pretrained('saved_dire'). Finally, load the peft_model as follows:

Load the entire model with adapters

peft_model = PeftModel.from_pretrained(base_model, saved_dire)

Load adapter1 and adapter2

peft_model.load_adapter(saved_dire + ‘/adapter1’, adapter_name=‘adapter1’)

peft_model.load_adapter(saved_dire + ‘/adapter2’, adapter_name=‘adapter2’)

Here is the screenshot of that part:

Also, here is the link to the turtorial,
https://github.com/akashghimireOfficial/HuggingFace_101/blob/master/Turtorial/12_more_about__get_peft_model.ipynb

roman174 · July 7, 2024, 4:35pm

Hi, for what you have defined config when you don’t use it?

YuchengShi · August 15, 2024, 3:33pm

Thank you for your effort. It is working now.

Topic		Replies	Views
Proper way of saving/loading models for complex workflows 🤗Transformers	2	12	July 22, 2025
Retraining peft model Intermediate	3	2930	March 1, 2024
Loading and saving a model Beginners	2	12736	September 14, 2024
Issue with PEFT model save_pretrained Beginners	0	259	August 11, 2024
Inference, checkpoint Beginners	0	873	December 5, 2023

Correct way to save/load adapters and checkpoints in PEFT

Load the entire model with adapters

Load adapter1 and adapter2

Related topics