Mixed Precision training (fp16), how to use in production?

harrystamenl · July 7, 2022, 10:39am

I’ve fine-tuned a roberta model and a deberta model both in fp16. The deberta was pre-trained in fp16.

But I want to use the model for production.

Is it possible to convert the fp16 model to onnx precision 16 and use in production?

harrystamenl · July 7, 2022, 10:40am

When I try i have to use floating point 32 even though it makes no difference. Should I be looking into bf16?

Topic		Replies	Views
Can I use fp16 model for mixed precision training? 🤗Transformers	0	296	January 16, 2024
Model pre-training precision database: fp16, fp32, bf16 🤗Transformers	4	7054	December 3, 2022
Does it ever make sense to finetune w fp32 if the base model was trained w fp16? Intermediate	1	749	July 8, 2022
Convert DeBERTa model to ONNX with mixed precision Models	0	1209	January 6, 2023
Does fp16 training compromise accuracy? Models	2	1198	May 17, 2022