Less Trainable Parameters after quantization

ChrisMcCormick · February 29, 2024, 5:03am

Lastly, I think we need to be careful in what significance we give to seeing the number of trainable parameters going down… You can arbitrarily reduce the number of “trainable parameters” in a model simply by choosing to freeze parts of the model (you just set requires_grad = False on a weight matrix), and I think that’s all that’s happening here.

Don’t conflate that with “parameter efficient fine tuning” techniques like LoRA, where you get to train fewer parameters while still getting a similar effect to training all of them.

Topic		Replies	Views
Number of parameters reduced after loading in 4bit Models	7	921	June 28, 2024
Retraining peft model Intermediate	3	2924	March 1, 2024
Difference in Number of Parameters for load_in_4bit Beginners	0	550	August 2, 2023
Pretrained Model for Fine-Tuning has 100% Trainable Parameters 🤗Transformers	2	128	January 17, 2025
Get number of parameters for different parts of a model Beginners	0	5669	May 10, 2021

Less Trainable Parameters after quantization

Related topics