Int8 quantization working, int4 and float8 not

shakedregev · April 1, 2024, 10:10pm

Following the example in the Readme, tried to run this:

        quanto.quantize(self.model, weights=quanto.qfloat8, activations=quanto.qfloat8)
        quanto.freeze(self.model)

and got (the error occurs later in the code, this line runs fine)

HuggingFace error: expected mat1 and mat2 to have the same dtype, but got: float != c10::Half

The same failure and error occurs when trying to use int4. This same line works with int8, i.e.

        quanto.quantize(self.model, weights=quanto.qint8, activations=quanto.qint8)

I’m not entirely sure if the failure is in my code or within the quanto library. I did successfully run the example within quanto when I changed it to float8. But I thought this error might be happening because there are types that don’t exist in pytorch, whereas int8 does.

Topic		Replies	Views
Looks like the new transformer 4.49.0 has some issues 🤗Transformers	3	234	March 6, 2025
Usage of calibrate() from quanto Beginners	0	178	April 2, 2024
The quantization code in the "Gentle Introduction to 8-bit Matrix Multiplication for transformers" blog post yields error 🤗Transformers	1	724	May 29, 2023
HugginFace dataset error: RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.HalfTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor 🤗Datasets	3	11433	May 6, 2022
When using AutoModelForCausalLM, THUDM/cogagent-vqa-hf and load_in_8bit I get this error : self and mat2 must have the same dtype, but got Half and Char 🤗Transformers	0	222	February 4, 2024

Int8 quantization working, int4 and float8 not

Related topics