Loading and saving a model

jmeld · March 3, 2024, 10:05pm

So I think I found a solution to this, but if anyone has more info on this topic please lmk! After the first training epochs, save the fine-tuned model. Then re-load the base model in some variable, then use the merge_and_unload() command to merge the fine-tuned model and the base-model. Then save the merged model. The saved merged model will be the size of the base model, with the fine-tuned layers incorporated. To further fine-tune that model, load the merged model and pass fine-tune that merged model as if it were the base model.

python
# Save fine-tuned model
trainer_filepath= f"trainer/llama7b/{train_util.get_time()}"
trainer.model.save_pretrained(trainer_filepath)

# reload base model
base_model= AutoModelForCausalLM.from_pretrained(model_name, token= huggingface_token)

# merge base model and fine-tuned model
merged_model= PeftModel.from_pretrained(base_model, trainer_filepath)
merged_model= merged_model.merge_and_unload()

# save merged model
merged_model_path= f"model/llama7b/merged_{train_util.get_time()}"
merged_model.save_pretrained(merged_model_path)

If there’s a better solution to this problem, please lmk!

Topic		Replies	Views
Correct way to save/load adapters and checkpoints in PEFT 🤗Transformers	10	15791	September 8, 2025
Retraining peft model Intermediate	3	2955	March 1, 2024
Proper way of saving/loading models for complex workflows 🤗Transformers	2	53	July 22, 2025
How to properly load the PEFT LoRA model 🤗Transformers	4	7297	April 13, 2025
Direct Load vs. Base Model + LoRA: How Should You Use It? Models	1	156	March 12, 2025

Loading and saving a model

Related topics