How can i training a MLM without labels?

weijie210 · August 3, 2023, 2:26am

I would like to train a masked language model to generate text using reinforcement learning. As the text should not be the same as the masked text but rather according to the reward function, what should be the labels for the model then?
In the case of training a T5, I would require either decoder inputs or labels, but since I do not want to use teacher forcing, but rather feed in the generated tokens in an autoregressive manner. I am unclear about this sort of training.

Topic		Replies	Views
How to train your own corpus without labels 🤗Transformers	2	3933	May 25, 2021
MLM vs CLM, can be exchanged? Models	0	1054	August 21, 2022
Is there a possibility to use MLM modelling for pretraining for autocasualLM model like MPT or falcon? If yes, Has someone tried it? Are there any relevant code bases which I can use? 🤗Transformers	0	214	August 16, 2023
Pretraining T5 from scratch using MLM Models	1	394	December 6, 2024
Fine tune Masked Language Model on custom dataset Beginners	5	6064	August 20, 2020

How can i training a MLM without labels?

Related topics