Help with Training a Custom Model using Hugging Face Transformers

nolharrose · October 11, 2024, 5:21pm

Hi everyone,

I’ve been trying to train a model using the Hugging Face Transformers library, but I’m encountering some issues. I want to create a custom language model with my dataset, but the training process is taking longer than expected, and the model’s accuracy is not where I’d like it to be.

My Problems:

Training Time: My model has been training for 10 hours, but it still has a very low accuracy rate. The dataset I’m using contains about 50,000 samples. How can I optimize the training time?
Hyperparameter Tuning: I’m confused about which hyperparameters I should experiment with. How can I evaluate the impact of parameters like learning rate and batch size?
Model Selection: I’m currently using a BERT-based model, but I wonder if another model might yield better results. What model recommendations do you have?

Thanks in advance for your help! If you need more information, feel free to ask.

Topic		Replies	Views
Training a language model from scratch with tensorflow (not pytorch)? Intermediate	4	877	August 9, 2021
Trainer.train() seems to finish almost instantly 🤗Transformers	0	522	September 29, 2023
Training of new ELECTRA or ConvBERT language model possible? 🤗Transformers	0	264	May 3, 2021
Any reference on how to train (unsupervised learning) longformer model from scratch (not fine-tuning) using Tensorflow? 🤗Transformers	0	974	February 25, 2021
Error when finetuning pretrained huggingface conv-ai chatbot model 🤗Transformers	2	818	April 19, 2021

Help with Training a Custom Model using Hugging Face Transformers

Related topics