Trainer class optimization for transformer models

jeffreyyu0602 · January 8, 2022, 1:01am

I am following the “fine-tuning a pretrained model” on the huggingface transformers tutorial. I replaced the bert-base-uncased model with google/mobilebert-uncased. I can get decent result using a huggingface Trainer, but when I trained using native PyTorch, the accuracy stucks around 50%. I am wondering is the huggingface Trainer doing some optimization that I must explicitly add in the native PyTorch code in order to work? Thank you!

Topic		Replies	Views
Training General Pytorch model with HuggingFace's Trainer 🤗Transformers	0	409	May 7, 2023
Finetuing GPT model? 🤗Transformers	2	367	August 29, 2021
Transformers v3.0.0 is out! 🤗Transformers	0	1955	July 7, 2020
Fine-Tuning / Pre-Training Tips 🤗Transformers	1	2978	August 5, 2022
Gradual Unfreezing support for Fine tuning models 🤗Transformers	3	4001	August 26, 2020

Trainer class optimization for transformer models

Related topics