Usind a fine-tuned sentence completion model in a Masked LM task

PeteSandoval · September 22, 2021, 5:49pm

Is it possible to use a BERT model that has been fine-tuned already (e.g. SQUAD-tuned BERT) on a masked LM task? I suspect that the sentence-completion model that is added on top of BERT is fundamentally incompatible with a masked LM task, but I’d like to know for a fact.

I’ve attempted to do this, using:

from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("deepset/bert-base-cased-squad2")
model = AutoModelForMaskedLM.from_pretrained("deepset/bert-base-cased-squad2")

but the results are very bad, so maybe I’m missing a step.

Thanks!

nielsr · September 23, 2021, 8:14am

Hi,

that’s not really possible, unless the model has a language modeling head that has been trained. If that’s not the case, it will load a randomly initialized language modeling head, which gives random predictions.

The "deepset/bert-base-cased-squad2" checkpoint has a fine-tuned question-answering head, but not a trained language modeling head.

PeteSandoval · September 23, 2021, 9:57pm

Thank you!

Topic		Replies	Views
Is masking still used when finetuning a BERT model? Beginners	1	1322	July 29, 2020
BertForMaskedLM model require fine-tuning? Beginners	0	644	August 7, 2022
Fine-tuning BERT Model on domain specific language and for classification 🤗Transformers	7	8427	November 14, 2024
Use custom model for mask filling using pipeline 🤗Transformers	0	339	September 27, 2023
Fine-tune BERT for Masked Language Modeling 🤗Transformers	3	3024	January 25, 2021

Usind a fine-tuned sentence completion model in a Masked LM task

Related topics