What is the recommended way to do inference with low precision during training?

hu22nlp · October 22, 2022, 3:01pm

It seems converting the model type will change the training loss.
For instance, the training loss of a) and b) is inconsistent:
a):

train(model)
model.half()
eval(model)
model.float()

b):

train(model)
eval(model)

I have to use deepcopy to solve the issue:

train(model)
model_copy = deepcopy.copy(model).half()
eval_model(model_copy)

Is there better way to evaluate the model in fp16 during training without hard-copy the model?

smangrul · December 6, 2022, 6:13am

Hello @hu22nlp,

Are you using mixed precision? If yes, then the inference happens with fp16/bf16 weights by default and no changes are required, only the final loss is converted to float32 for stability.

Topic		Replies	Views
Model pre-training precision database: fp16, fp32, bf16 🤗Transformers	4	7046	December 3, 2022
Can I use fp16 model for mixed precision training? 🤗Transformers	0	295	January 16, 2024
Does it ever make sense to finetune w fp32 if the base model was trained w fp16? Intermediate	1	747	July 8, 2022
Does fp16 training compromise accuracy? Models	2	1194	May 17, 2022
Mixed Precision training (fp16), how to use in production? 🤗Transformers	1	921	July 7, 2022

What is the recommended way to do inference with low precision during training?

Related topics