Accuracy of MLM model

Anne · June 2, 2021, 2:38am

How to calculate the accuracy of the testing dataset when we build MLM model using scrach?

sanjaysingh23 · July 7, 2021, 8:16pm

have you got the answers?

Anne · July 9, 2021, 1:43am

Hi
Yes, I found a way but not sure whether it’s the best way.
What I’m doing is for each sentence I’m masking a random word in the sentence and ask the model to predict it. If the actual word is in the predicted list I’m increasing the tp(positive) value or else the tn(negative) value. finally I’m calculating the accuracy (tp/tp+tn).

and I’m calculating the perplexity value using the following code

Hope my answer is clear to you…

sanjaysingh23 · July 9, 2021, 9:25am

Hey, thank you for your response,
i am also calculating perplexity using the same code and i am using perplexity as a metric rather than accuracy.
This is because i don’t thing accuracy would be a better choice(as a metric) for mlm because there can be may words which can be used for a given mask. (feel free the correct) , instead i am calculating perplexity in the compute metric function and printing it along with training_loss and validation loss along in the logs.

Thanks again!!

sanjaysingh23 · July 12, 2021, 4:05pm

Also how much minimum data is required to fine tune roberta any idea?

Anne · July 13, 2021, 1:07am

sorry. I don’t have an idea about it…
do you know a way to build a n-gram model for word prediction and calculate the perplexity value of it?

Topic		Replies	Views
Accuracy of Masked LM training Beginners	0	1028	June 15, 2022
Metrics for masked language modeling (mlm) Beginners	0	499	September 16, 2021
How to correctly evaluate a Masked Language Model? 🤗Transformers	3	4382	August 11, 2023
Fine-tuned MLM based RoBERTa not improving performance Research	2	947	April 20, 2023
How can I check mlm accuracy during training RoBERTa? Beginners	7	2677	August 30, 2021

Accuracy of MLM model

Related topics