Metrics for masked language modeling (mlm)

MilanKalkenings · September 16, 2021, 8:17pm

i want to compare the fit of my bert model before and after performing mlm for some epochs on my own textual data.

according to Perplexity of fixed-length models — transformers 4.10.1 documentation perplexity isn’t well defined for mlm. which metric should i use instead? so far i calculated the accuracy on the masked tokens only (compare actual labels of masked inputs with the token predicted by the model for the masked position. this is obviously not a good metric, since it punishes synonyms etc. like any other false prediction)

Topic		Replies	Views
Accuracy of MLM model 🤗Transformers	5	1570	July 13, 2021
How to correctly evaluate a Masked Language Model? 🤗Transformers	3	4559	August 11, 2023
Useful compute_metrics functions for perplexity 🤗Transformers	0	654	September 29, 2022
Masked language modeling loss 🤗Transformers	1	4833	August 13, 2020
Evaluation metrics for BERT-like LMs Research	4	4671	December 6, 2024

Metrics for masked language modeling (mlm)

Related topics