Less Trainable Parameters after quantization

xguihugging · October 23, 2023, 6:41am

After some investigation, I think it might be due to Linear4bit(https://github.com/TimDettmers/bitsandbytes/blob/main/bitsandbytes/nn/modules.py#L207) is setting requires_grad=False. So it will reduce a lot of parameters.

Topic		Replies	Views
Number of parameters reduced after loading in 4bit Models	7	934	June 28, 2024
Parameter Count & Shape Discrepancies in 4-bit vs. Higher bit LLM models 🤗Transformers	2	680	June 3, 2024
Does quantization compress the model weights? Research	16	378	September 26, 2024
Difference in Number of Parameters for load_in_4bit Beginners	0	556	August 2, 2023
Loading quantised weights does not work Beginners	0	122	April 12, 2024