Does autogpt-q require float16?

jmhessel · August 28, 2023, 7:17pm

Hi! I’ve been excited about the recent quantization integration of auto-gptq and appreciate the work folks did to make this happen. I was trying to play around a bit, and saw this line

github.com

huggingface/transformers/blob/50573c648ae953dcc1b94d663651f07fb02268f4/src/transformers/modeling_utils.py#L2579-L2583


      
          logger.info(
              f"Overriding torch_dtype={torch_dtype} with `torch_dtype=torch.float16` due to "
              "requirements of `auto-gptq` to enable model quantization "
          )
          torch_dtype = torch.float16

which seems to imply that auto gptq only supports float16 vs. bfloat16… I uncommented this line and things seem to be working fine — do folks have thoughts about this? TheBloke seems to sometimes run quantization with bfloat16 ([BUG]CUDA OUT OF MEMORY · Issue #179 · PanQiWei/AutoGPTQ · GitHub) so I am not sure why this line is here, but could be missing something (e.g., maybe some of the CUDA kernels are float16 only)?

Topic		Replies	Views
GPTQ model to bfloat16 🤗Transformers	0	431	January 10, 2024
Loading in Float32 vs Float16 has very different speed 🤗Transformers	1	127	February 20, 2025
Float16 on CPU torch support Beginners	0	1022	January 16, 2024
Impossible to train a model using both bf16 mixed precision training and torch compile, RuntimeError: expected mat1 and mat2 to have the same dtype 🤗Transformers	8	1974	October 28, 2024
4-bit quantization Intermediate	0	468	November 18, 2023

Does autogpt-q require float16?

Related topics