Do I Need to Use zero_to_fp32.py After Training Llama with run_clm.py?

liuyanyi · December 4, 2023, 10:20am

Hey community,

I’ve been training a Llama model using run_clm.py and came across something I’m unsure about. After training, in the checkpoints folder, I found this zero_to_fp32.py script. My question is, do I actually need to run this on my completed model, or can I just go ahead with the model.safetensors file which seems to hold the bf16 weights?

Appreciate any insights you guys might have!

Topic		Replies	Views
Llama2 torch_dtype Models	0	308	November 20, 2023
bf16=True in TrainingArgument Beginners	0	1069	July 2, 2023
Converting weights to .safetensors with HF format -> CLIP-L is ruined. Why? Beginners	18	1386	September 21, 2024
Continual Training on my own checkpoint 🤗Transformers	1	84	June 27, 2024
Why models(llama in particular) upcasts softmax to fp32? 🤗Transformers	1	1174	June 29, 2023

Do I Need to Use zero_to_fp32.py After Training Llama with run_clm.py?

Related topics