Unable to Finetune Deberta

manujmalik · October 26, 2022, 11:16am

I am trying to finetune deberta for irony detection task, colab’s notebook link can be found here

When I try to use ‘microsoft/deberta-v3-base’ checkpoint with AutoModel, I’m getting the following error :

RuntimeError: Expected target size [32, 2], got [32]

but when I use the same model with ‘bert-base-uncased’ or roberta (with some changes in head) it works fine. The one can find working code for bert based in this notebook.

When I printed the shapes of predictions and labels, I got outputs as torch.Size([32, 30, 2]), torch.Size([32]) respectively. In the case of bert, shapes of outputs were torch.Size([32, 2]), torch.Size([32]) for predictions and labels.

Here 32 is the batch size, and 30 is the sequence length.

Can someone let me know what I’m doing wrong?

Topic		Replies	Views
Why is uploaded model twice the size of actual model? Intermediate	6	2695	June 12, 2022
ValueError: Expected input batch_size (16) to match target batch_size (64) Beginners	7	5000	November 7, 2023
Fine-Tuning DeBERTa Produces Non-Results 🤗Transformers	3	3056	September 21, 2022
Multilabel sequence classification with Roberta value error expected input batch size to match target batch size 🤗Transformers	1	4229	March 2, 2021
Forcing BERT hidden dimension size 🤗Transformers	1	1129	December 19, 2023

Unable to Finetune Deberta

Related topics