Fine-tune BERT with triangular mask

mickfil · January 31, 2024, 3:34pm

Hello,

I am trying to implement a paper with the following approach :

The idea is to fine-tune a BERT model for a language modeling task (next token prediction), with a Triangular Mask in order to enforce left-to-right language modeling.

I would like to know if it’s possible to fine-tune a BERT with a triangular mask from a BERT pre-trained with a square mask ? If so, how to do it in the implementation ?

Is there a simple way to do it ? Or do I need to modify the source code ?

Thank you so much for your help !

Topic		Replies	Views
Fine-tuning BERT with deterministic masking instead of random masking Beginners	0	162	April 22, 2024
Is masking still used when finetuning a BERT model? Beginners	1	1322	July 29, 2020
Fine tunning pretrained bert with new vocabulary Beginners	0	449	October 1, 2020
How to fine-tune BERT model for next word prediction? Beginners	0	1113	October 3, 2021
How to do sequence fine tuning? Beginners	5	740	July 22, 2020

Fine-tune BERT with triangular mask

Related topics