How to do unsupervised fine-tuning?

msamogh · January 29, 2021, 2:11am

I have a custom text dataset, which I want BERT to get acquainted with. My final goal is not to run any supervised task (it is actually to act as a starting point to get sentence embeddings from S-BERT.

I just want to continue doing the unsupervised training on my dataset. How do I do this?

So far, I have come across two possible candidates in the documentation for this:

BertForPreTraining (the self-explanatory name led me to this)
BERTForMaskedLM (as used in this blog post).

Can both of them be used for this purpose? Is one more attuned to my purpose? Have you previously tried to do something like this? Any additional suggestions would also be very helpful.

Thank you

valhalla · January 29, 2021, 9:16am

BertForPreTraining has two heads, one for masked language modeling and one for next sentence prediction task. This class should be used when you want to pre-train the bert as described in the paper i.e MLM + NSP

BERTForMaskedLM, is for MLM training which can be used for pre-training.

Topic		Replies	Views
Dataset for fake news detection, fine tune or pre-train Beginners	7	1768	October 12, 2020
Continual pre-training vs. Fine-tuning a language model with MLM 🤗Transformers	5	8873	November 30, 2021
Continue pre-training Greek BERT with domain specific dataset 🤗Transformers	10	4697	January 4, 2023
How to change BERT's pre-training tasks? Beginners	2	995	March 29, 2021
How to train BERT from scratch on a new domain for both MLM and NSP? Models	2	2325	February 6, 2021

How to do unsupervised fine-tuning?

Related topics