Does transformers 3.5.1 support auto mixed precision training？

davidhu · May 26, 2021, 11:43am

I’m using a GPTLMHead model in pytorch.
Is it possible , i add autocast() in the forward function in GPTLMHead and change the training process followed the Automatic Mixed Precision — PyTorch Tutorials 1.8.1+cu102 documentation

sgugger · May 26, 2021, 11:51am

Yes, it’s possible.

davidhu · June 1, 2021, 2:48am

I train the gpt model using V100 with mixed precision training. But it happens that in the inference time, the generated token is mess with the autocast() open.when i turn off it, everything is ok.

Topic		Replies	Views
Bfloat16 conversion results in significantly slower computation for various transformer models 🤗Transformers	0	1418	December 20, 2021
How to fine-tune the output head of the pre-trained Transformer models? 🤗Transformers	0	489	October 19, 2020
Why if use cache in gpt2 model from transformers , the logits are different if i do a forward pass from scratch Models	1	355	February 25, 2024
Fine-tuned transformers model generats nonsensical results Beginners	0	216	July 10, 2024
Finetuing GPT model? 🤗Transformers	2	353	August 29, 2021

Does transformers 3.5.1 support auto mixed precision training？

Related topics