Additional pre-training objective function

Yann · July 3, 2021, 7:29pm

Hi everyone,

I would like to use an already pre-train version of BERT model, and run additional steps of pre-training on a domain specific dataset (english learners dataset).

During the pre-training, I would like to use the Masked Language Modeling objective, as well as another custom objective function (classification on CEFR levels).

I am looking for any help on how to add another objective function (the total loss will be the sum of the two losses), with the associated architecture modifications (typically I need some more output nodes for the classification task).

Thanks in advance,

Yann

Topic		Replies	Views
Continue pre-training BERT Intermediate	5	919	November 13, 2023
Training on Domain specific Dataset Beginners	3	577	March 22, 2021
How to "further pretrain" a tokenizer (do I need to do so?) 🤗Tokenizers	5	3244	February 20, 2022
Fill-mask and classification at the same time Beginners	4	551	March 18, 2022
Learning rate for further pretraining BERT on masked language modeling task 🤗Transformers	0	166	September 16, 2021

Additional pre-training objective function

Related Topics