Having trouble loading a fine-tuned PEFT model (CodeLlama-13b-Instruct-hf base)

thenatefisher · August 30, 2023, 9:29pm

I fine tuned codellama using PEFT, although I added some custom tokens and also a special token for padding. So instead of the original token vocab size of 32016, the adapter was trained using a slightly larger vocab of 32023. It seemed to work correctly after training. However, when I save it (trainer.model.save_pretrained(...) ) and reload it (AutoPeftModelForCausalLM.from_pretrained(...) ), I get this error:

RuntimeError: Error(s) in loading state_dict for PeftModelForCausalLM: size mismatch for base_model.model.model.embed_tokens.modules_to_save.default.weight: copying a param with shape torch.Size([32023, 5120]) from checkpoint, the shape in current model is torch.Size([32016, 5120]).

Any way to fix this?

Agniva · November 21, 2023, 12:13pm

hey @thenatefisher
in order to load a model where you have changed token embeddings, lm head, you need to:

Add embeddings as a lora layer so it will be finetuned as well
Add embeddings as part of modules_to_save list in lora config
Make sure adapters are saved as checkpoints
Load base model again and add adapter to it: notice model_id is the saved adapter checkpoint

hope this helps!

# Free memory for loading base model and merging adapter weights
del model
del trainer
torch.cuda.empty_cache()

# load base model and reshape embedding
model = AutoModelForCausalLM.from_pretrained(**model_params)
model.resize_token_embeddings(len(tokenizer))

# Merge Model with Adapter
model = PeftModel.from_pretrained(model=model, model_id="sft_llm/checkpoint-30")```

Victorano · October 6, 2024, 2:36pm

from trl import setup_chat_format
_, tokenizer = setup_chat_format(base_model, tokenizer)

Check len(tokenizer) before and after the above code block and see the difference.
You can then resize the base model token embeddings using the new tokenizer

base_model.resize_token_embeddings(len(tokenizer))

Topic		Replies	Views
Size mismatch error in PEFT fine tuned model 🤗Transformers	4	1460	July 2, 2024
After llama fine tuning, model merging fails Beginners	1	35	May 20, 2025
Size Mismatch error for LLM checkpoint of PEFT model with a resized token embeddings Models	4	353	January 10, 2025
Loading Peft model from checkpoint leading into size missmatch 🤗Transformers	6	10283	February 7, 2024
Why my finetuned model size so small and unable to load Beginners	0	115	July 9, 2024

Having trouble loading a fine-tuned PEFT model (CodeLlama-13b-Instruct-hf base)

Related topics