Error saving quantized model

omoekan · June 29, 2022, 4:31pm

I get the following error trying to save a quantized model. Can anyone help? Thanks.

quantize model

quantized_model = torch.quantization.quantize_dynamic(
model, {torch.nn.Linear}, dtype=torch.qint8
)
quantized_model.save_pretrained(quantized_dir)

AttributeError Traceback (most recent call last)
in ()
3 model, {torch.nn.Linear}, dtype=torch.qint8
4 )
----> 5 quantized_model.save_pretrained(quantized_dir)

1 frames
/usr/local/lib/python3.7/dist-packages/transformers/modeling_utils.py in shard_checkpoint(state_dict, max_shard_size)
291
292 for key, weight in state_dict.items():
→ 293 weight_size = weight.numel() * dtype_byte_size(weight.dtype)
294
295 # If this weight is going to tip up over the maximal size, we split.

AttributeError: ‘torch.dtype’ object has no attribute ‘numel’

omoekan · June 30, 2022, 9:59pm

I found the solution here

rohanj · November 24, 2022, 9:59am

How did this issue got resolved? Am still getting the same error.

dquan · February 14, 2023, 3:31am

Im getting the same error. Im trying to save a quantized model using the following line torch.quantization.quantize_dynamic(model, {torch.nn.Linear}, dtype=torch.float16).

When running quantized_model.save_pretrained(path_dir), I’m running into the error AttributeError: ‘torch.dtype’ object has no attribute ‘numel’.

Anyone have a solution?

dquan · February 16, 2023, 6:10pm

I was actually able to resolve the error from the Pegasus Model Weights Compression/Pruning post above. Specifically, I was using Pytorch version 1.11.0 and HuggingFace version 4.20.1.

Topic		Replies	Views
Model quantization Models	5	2612	February 15, 2023
Problem with pushing quantized model to hub 🤗Transformers	3	286	October 14, 2024
ValueError: The model is quantized with QuantizationMethod.QUANTO and is not serializable 🤗Transformers	1	327	May 20, 2024
Issues when trying to save quantized model locally Beginners	0	330	May 21, 2024
Exception in save_pretrained due to recent changes 🤗Transformers	0	178	June 23, 2024

Error saving quantized model

quantize model

Related topics