How does one reinitialize the weights of a Hugging Face LLaMA v2 model the official way as the original model?

NoSavedDATA · December 7, 2023, 1:56am

I have faced the same issue when replicating bigger better faster.

As I could understand, applying weights to a model only before you train it. I didn’t test if it is possible to overwrite initializations before training.

You must provide the specific layers you want to reinitialize to your model again, and you should initialize this layers before passing to the model.

model = AutoModelForCausalLM.from_config(config)

new_llama_block = LLaMa_Block() or something like that
new_llama_block.apply(_init_weights)

model.blocks[-1] = new_llama_block

Topic		Replies	Views
Loading a retrained model locally Beginners	2	2415	February 5, 2024
How to choose std for weight init for llama 2 after reinitialize? Beginners	0	460	January 19, 2024
Model Size Mismatch Beginners	0	313	May 11, 2024
How does one randomly reinitialize gpt2? Beginners	0	389	February 5, 2024
How to reinitialize from scratch GPT2 XL in Hugging Face (HF)? Beginners	2	86	August 13, 2024

How does one reinitialize the weights of a Hugging Face LLaMA v2 model the official way as the original model?

Related topics