Using Quantization with fp16/bf16 Trainer flag

dhruvmullick · February 14, 2024, 4:50pm

In HF’s colab notebook for QLora, they use fp16=True in the training arguments even though quantization config uses bf16 for compute.

I have two questions here:

What is the purpose of the fp16 flag in training arguments? I believe this flag is for mixed precision training but that shouldn’t be relevant if we’re using QLora training?
Shouldn’t the fp16 flag be False and the bf16 flag be True?

Topic		Replies	Views
bf16=True in TrainingArgument Beginners	0	1045	July 2, 2023
Training Arguments to do pure bf16 training? 🤗Transformers	0	1983	December 20, 2023
Fp16, bf16 in TrainingArgs vs BitsAndBytesConfig Beginners	0	787	June 30, 2023
The role of the bf16 arguments in SFTConfig 🤗Transformers	0	393	July 25, 2024
How to generate using a fine-tuned qlora cast to bfloat16 Beginners	1	1199	April 6, 2024