MNLI Inference on a fine-tuned model from hub

Ali93H · February 24, 2021, 4:12pm

Hello,
I tried to run various models such as

huggingface/distilbert-base-uncased-finetuned-mnli
microsoft/deberta-v2-xxlarge-mnli
roberta-large-mnli
squeezebert/squeezebert-mnli

I can see the weights are loaded but the accuracy I get is about 7%.
I use “transformers_version”: “4.3.2”, and the following arguments:

python run_glue.py --model_name_or_path roberta-large-mnli --task_name mnli --do_eval --max_seq_length 128 --output_dir /tmp/mnli/

Any help would be appreciated,
Thank you

rgwatwormhill · February 24, 2021, 7:50pm

Hi Ali93H,

I don’t understand what you are trying to do. Are you wanting to do further fine-tuning (in which case you might want DistilBertForMaskedLM) or to classify your texts for example sentiment analysis (in which case you might want DistilBertForSequenceClassification).

The transformers docs are here Quick tour — transformers 4.3.0 documentation ,Summary of the models — transformers 4.3.0 documentation , DistilBERT — transformers 4.3.0 documentation

Ali93H · February 24, 2021, 8:04pm

Thanks for your reply. I’m trying to do an evaluation on an already fine-tuned model (reproduce the accuracy metrics).
If I try this with mrpc or stsb (with fine-tuned models, of course) it works just fine, but I don’t know what is the issue with MNLI.

Ali93H · February 25, 2021, 3:57pm

Here is the source of the issue and a solution: [run_glue] Add MNLI compatible mode by JetRunner · Pull Request #10203 · huggingface/transformers · GitHub

Topic		Replies	Views
0% accuracy when finetuning from certain models. [CLS] token embeddings not learned 🤗Transformers	1	614	November 2, 2023
Very poor model performance post-training 🤗Transformers	0	401	November 1, 2021
Getting error while fine tuning Deberta v3 Large Models	12	6383	November 10, 2021
Run transformer fine-tuning of BERT 🤗Hub	0	444	November 27, 2022
Run my own model on GLUE tasks 🤗Transformers	0	243	August 8, 2021

MNLI Inference on a fine-tuned model from hub

Related topics