QLoRA with GPTQ

lewisbails · October 10, 2023, 11:43am

UserWarning: You passed a tokenizer with padding_sidenot equal to rightto the SFTTrainer. This might lead to some unexpected behaviour due to overflow issues when training a model in half-precision. You might consider adding tokenizer.padding_side = ‘right’ to your code.

It is working now after adding padding_side=right to the tokenizer. Why does the padding side affect overflow in half-precision training?

Topic		Replies	Views
LoRA finetuning without quantization (8bit) 🤗Transformers	1	986	February 23, 2024
Fine tuning for Llama2 based model with LoftQ quantization 🤗Transformers	7	2383	January 24, 2024
Fine tuning using LOFTQ - CUDA out of memory error 🤗Transformers	4	382	February 18, 2024
"You cannot perform fine-tuning on purely quantized models." error in LoRA model training? 🤗Transformers	3	2745	August 16, 2024
CUDA Out of Memory Error SFTTrainer 🤗Transformers	1	175	February 16, 2025

QLoRA with GPTQ

Related topics