Pre-trained DeBERTa

Mintbach · June 14, 2023, 10:30am

Hi,

I wanted to use DeBERTa.
Somehow the preview of its unmasking abilities seems very bad.

I looked at the source code and cannot see the addition of the absolute positions.

Can someone explain me why the model performs so bad at MLM preview.
Maybe I overlooked the addition of the absolute positions in the source code.
An explanation of the implementation would be really helpful aswell!

Thank you!
Stephan

Topic		Replies	Views
Pre-trained DeBERTa - Weak MLM performance any hints? Research	1	277	July 21, 2023
DeBERTa absolute Positions Beginners	2	350	April 15, 2021
Deberta v3 Input length and Absolute positional embeddings Models	0	177	September 30, 2023
deBERTa v3 implementation in HuggingFace (with RTD training) 🤗Transformers	5	330	July 12, 2025
Fine-Tuning deberta 🤗AutoTrain	0	1351	December 14, 2021

Pre-trained DeBERTa

Related topics