No Biases for Llama-3.2-3B-Instruct

Inoob · December 22, 2024, 3:56am

I used this code to extract the weights and biases of the model:

from transformers import AutoModelForCausalLM

model_name = "meta-llama/Llama-3.2-3B-Instruct"
model = AutoModelForCausalLM.from_pretrained(model_name)
model_weights = model.state_dict()
weights = {}
biases = {}

for key, value in model_weights.items():
    if 'weight' in key:
        weights[key] = value
    else:
        biases[key] = value

print("Weights:", weights)
print("Biases:", biases)

However, the output renders the biases dictionary empty.

Is this intentional behaviour that this model do not have any biases?

Someone please explain this, since biases is one of the fundemental parts of NNs and can significantly boost preformance. Why did meta do this?

P.S. Just to be sure, I added this check for the length of weights:

print(len(weights.keys()), len(model_weights.keys()))

and the output:

255 255

Meaning there REALLY is not bias in this model (is it what Meta mean by models not having biases?)

Topic		Replies	Views
Len(trainer.model.state_dict().keys()) reduced after calling trainer.train() 🤗Transformers	0	274	June 8, 2023
Error: ‘AlbertModel’ object has no attribute ‘bias’ 🤗Transformers	0	475	August 16, 2021
Why are Llama2 attention weights not lower triangular? Models	2	408	May 15, 2024
ValueError: weight is on the meta device when using Auto Model For Sequence Classification 🤗Accelerate	2	1978	November 30, 2023
Problem in Inference on "meta-llama/Meta-Llama-3.1-70B" Beginners	3	413	September 16, 2024

No Biases for Llama-3.2-3B-Instruct

Related topics