ValueError: The model is quantized with QuantizationMethod.QUANTO and is not serializable

gospacedev · April 28, 2024, 10:38am

Hi, I have quantized the model flan-t5-base to bit8, and I am trying to upload it to huggingface but I keep getting this error:

ValueError: The model is quantized with QuantizationMethod.QUANTO and is not serializable - check out the warnings from the logger on the traceback to understand the reason why the quantized model is not serializable.

Here’s the code:

from transformers import T5Tokenizer, T5ForConditionalGeneration, QuantoConfig

quantization_config = QuantoConfig(weights="int8")
quantized_model = T5ForConditionalGeneration.from_pretrained(model_id, low_cpu_mem_usage=True, quantization_config=quantization_config)

quantized_model.push_to_hub("flan-t5-base-8bit")

tiago-machado · May 20, 2024, 8:23pm

Same problem when trying to save the quantized model using save_pretrained

Topic		Replies	Views
Problem with pushing quantized model to hub 🤗Transformers	3	282	October 14, 2024
HuggingFace Transformers Error When Saving Model: TypeError: Object of type method is not JSON serializable 🤗Transformers	1	2534	February 27, 2024
Error Debugging Beginners	1	21	April 29, 2025
How to push model trained with pytorch_lightning in hugging face? Models	0	962	October 17, 2021
Issues when trying to save quantized model locally Beginners	0	329	May 21, 2024

ValueError: The model is quantized with QuantizationMethod.QUANTO and is not serializable

Related topics