Bf16 or fp16 with bitsandbytes int8?

Penn July 26, 2023, 3:51pm 1

I’m using bitsandbytes int8 and PEFT with transformers. Is there any advantage to using bf16 with int8 even though it gets cast to fp16 during quantization? Or should this be strictly avoided?

Topic		Replies	Views
Fp16, bf16 in TrainingArgs vs BitsAndBytesConfig Beginners	0	787	June 30, 2023
Low bf16 performance on TPU, int4 vs int8 quantizatoin 🤗Accelerate	0	355	June 1, 2024
Bitsandbytes `has_fp16_weights` issue 🤗Transformers	1	173	August 15, 2024
Bfloat16 conversion results in significantly slower computation for various transformer models 🤗Transformers	0	1418	December 20, 2021
Can we use mixed precision with all? (fp16 + fp32 + bf16) 🤗Transformers	0	273	December 1, 2022

Bf16 or fp16 with bitsandbytes int8?

Related topics