Unsup fine tuning embeddings

yugen2 · June 21, 2023, 8:50am

Hello,

I am trying to fine tune model embeddings without supervision. I am referring to the output layer of the attention model, the one which is used for generating predictions. In this tutorial, they show how to generate labels from unsup text using masking, but they do it using a masked LM model, which doesn’t give access to its attention output layer, only word level logits, which is one abstraction level higher. In this tutorial, on the other hand, they do fine tune the output layer embeddings but they do so with labeled data.

I know I can implement it myself by simply generating labels by masking. But I would like to know if there is already a package that does this. And more importantly, how is this not addressed in either of these very well thought-out tutorials? It seems to me like a no brainer.

Thanks!

Topic		Replies	Views
How to train your own corpus without labels 🤗Transformers	2	3965	May 25, 2021
How to train new token embedding to add to a pretrain model? 🤗Transformers	1	3662	January 6, 2021
Finetune language model for feature extraction 🤗Transformers	0	404	July 1, 2021
How to do unsupervised fine-tuning? 🤗Transformers	1	7026	January 29, 2021
How to fine-tune a pre-trained model and then get the embeddings? Beginners	2	3797	December 20, 2022

Unsup fine tuning embeddings

Related topics