Domain adaptation with MLM and NSP

monilouise · May 9, 2022, 9:25pm

Hi,

I’ve coded a domain adapter for further pretraining a BERT model (Portuguese language) in a specific domain. I used the article Fine-tuning a masked language model - Hugging Face Course as a guide, but it doesn’t focus on next sentence prediction, only masked language model.

I’d like to know what is necessary to change the code to add support for next sentence prediction. I know I should use BertForPreTraining instead of AutoModelForMaskedLM, but my question is how to add the data and abels for NSP.

Thanks in advance.

ablam · June 24, 2022, 8:01pm

Hi. Same question here. Followed the tutorial and was wondering how to go about domain adaption for generation tasks, which would be CLM and not MLM. Were you able to figure things out?

rishabhstha · January 4, 2023, 6:21am

I have a similar question. Were you able to figure it out?

pantroluna · January 18, 2024, 4:08am

I have a same question.

Topic		Replies	Views
Continue pre-training Greek BERT with domain specific dataset 🤗Transformers	10	4659	January 4, 2023
Framework for Continual Pretraining 🤗Transformers	0	1260	August 16, 2023
Next sentence prediction on custom model 🤗Transformers	3	3392	May 14, 2024
How to train BERT from scratch on a new domain for both MLM and NSP? Models	2	2299	February 6, 2021
Continue pre-training of Greek BERT with domain specific dataset Beginners	7	3029	August 6, 2021

Domain adaptation with MLM and NSP

Related topics