safetensors_rust.SafetensError while deserializing header: InvalidHeaderDeserialization

arvind-s · January 11, 2024, 6:56pm

Hi:
I am trying to load a checkpoint after fine-tuning a codeLlama model using PEFT.
This gives me an error:
File “/home/ubuntu/.cache/pypoetry/virtualenvs/llama-VNDZLnVB-py3.8/lib/python3.8/site-packages/safetensors/torch.py”, line 308, in load_file
with safe_open(filename, framework=“pt”, device=device) as f:
safetensors_rust.SafetensorError: Error while deserializing header: InvalidHeaderDeserialization

Here is my rather simple code for loading:
base_model = “codellama/CodeLlama-7b-hf”
model = AutoModelForCausalLM.from_pretrained(
base_model,
load_in_4bit=True,
torch_dtype=torch.float16,
device_map=“auto”,
)
tokenizer = AutoTokenizer.from_pretrained(“codellama/CodeLlama-7b-hf”)
output_dir = “fine-tuned-code-llama/checkpoint-400”
model = PeftModel.from_pretrained(model, output_dir)

The model was fine-tuned in 4-bit mode. I am wondering if this is a bug loading safetensors or something I am doing incorrectly.
Thank you!

Sneekit · February 23, 2024, 1:53pm

github.com/slai-labs/get-beam

Fine-tuning example doesn't work for custom datasets

opened 07:22AM - 27 Nov 23 UTC

closed 04:14PM - 11 Jan 24 UTC

vmatekole

The fine-tuning [script](https://github.com/slai-labs/get-beam/blob/main/example…s/finetune-llama/training.py) provided does not work with custom datasets. It appears there is a [bug](https://github.com/huggingface/transformers/issues/27397) related to PEFT. The [InvalidHeaderDeserialization](https://github.com/huggingface/transformers/issues/27397) message has been reported by other users. Strangely, the recommended changes of not calling torch.compile didn’t work for me. However, after looking at this [comment](https://github.com/huggingface/transformers/issues/27397#issuecomment-1804313865) I removed the following [code](https://github.com/slai-labs/get-beam/blob/354e856c513394f551440d1d73c8cc298d1c0826/examples/finetune-llama/training.py#L294-L296) and training completed successfully.

There is currently an issue in the compatibility between PEFT and Pytorch while using custom data training sets.

I encountered this issue due to the following optimizations before training. Removing these optimizations allowed my models to be correctly saved and loaded, however it did increase the VRAM requirements of training.

model.state_dict = (
lambda self, *_, **__: get_peft_model_state_dict(
self, old_state_dict()
)
).get(model, type(model))

if torch.__version__ >= "2" and sys.platform != "win32":
    model = torch.compile(model)

Topic		Replies	Views
safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooSmall 🤗Hub	1	1697	September 5, 2024
Llama2-70b SafetensorError: Error while deserializing header: HeaderTooLarge 🤗Transformers	0	1188	December 9, 2023
Resume from checkpoint Beginners	0	325	November 30, 2023
Handling Peft Model the right way (save, load, inference) 🤗Transformers	0	142	August 10, 2024
Safetensors format issue 🤗Transformers	2	1884	November 27, 2023

safetensors_rust.SafetensError while deserializing header: InvalidHeaderDeserialization

Related topics