Auto Vs DistilBert for Classification : Accuracy/F1 varies a lot

sssingh · March 31, 2022, 1:15pm

I am trying to fine-tune the “distilbert-base-uncased” with the “emotions” dataset (6 labels).
I am using DistilBertTokenizer.
When I use DistilBertForSequenceClassification.from_pretrained() I get an accuracy/f1 of only 35%,
but when II use AutoModelForSequenceClassificationfrom_pretrained() I get an accuracy/f1 over 90%

I thought it’s better to use a specific model than the Auto ones but it seems Auto works much better, what could be a logical explanation for this?

Thanks.

Topic		Replies	Views
0% accuracy when finetuning from certain models. [CLS] token embeddings not learned 🤗Transformers	1	608	November 2, 2023
Different accuracy values 🤗AutoTrain	0	21	October 12, 2024
Getting 40% accuracy. Need suggestions to improve! Beginners	12	3019	December 7, 2023
Why is my DistilBERT model performing poorly on some classes despite hyperparameter tuning? Beginners	15	113	March 9, 2025
Auto vs. Model-specific classes and tokenizers 🤗Transformers	0	389	April 4, 2023

Auto Vs DistilBert for Classification : Accuracy/F1 varies a lot

Related topics