Can we directly use the embeddings from masked language models?

olaffson · December 15, 2021, 7:09pm

Hello there,

I have a short conceptual question. I know can train a masked language model from scratch. By doing so with huggingface, I should be able to obtain a model that is very good at … filling the [mask] token!

But what about the embeddings? are they any good for clustering for instance? Note that I am NOT fine-tuning the MLM model in any way. I am only interested in the embeddings that come from the MLM task itself.

Any suggestions or papers greatly appreciated.
Thanks!

Topic		Replies	Views
Masked Language Modelling with Hugging Face / RoBERTa - Video tutorial Beginners	2	513	April 8, 2022
[HELP] How to include emojis in masked language modelling? Beginners	0	861	June 8, 2021
Finetuning on MLM task Models	0	659	June 29, 2021
SpanBERT, ELECTRA, MARGE from scratch? Beginners	5	1379	July 22, 2023
Sequence classification VS MaskedLM Beginners	1	737	October 8, 2020

Can we directly use the embeddings from masked language models?

Related topics