Get number of parameters for different parts of a model

AJHoeh · May 10, 2021, 5:47pm

Hey there,

I know I can get the number of trainable parameters in a pytorch model by using sum(p.numel() for p in model.parameters()), but how can I get the count for the different parts of the model? For example for BertForMaskedLM I tried using the code with model.base_model.parameters() and model.cls.parameters() but the sum of the results are way above the ressult for simply using model.parameters().

I am sure I must be missing something very obvious here but I dont know what.

EDIT: Ah, I figured the shared/cloned weights present both in the embedding layer and decoder are counted once for the model total since only one instance of them has effectively to be trained?

Best
Johannes

Topic		Replies	Views
Transformers module - parameter count and size 🤗Transformers	0	1219	January 12, 2024
Pretrained Model for Fine-Tuning has 100% Trainable Parameters 🤗Transformers	2	156	January 17, 2025
Difference in Number of Parameters for load_in_4bit Beginners	0	556	August 2, 2023
Less Trainable Parameters after quantization Intermediate	14	4459	May 2, 2024
InstructBLIP number of parameters Intermediate	0	279	August 18, 2023

Get number of parameters for different parts of a model

Related topics