Finetune language model for feature extraction

afurrier · July 1, 2021, 5:58pm

Hello, I was wondering if it’s possible to finetune a language model on a corpus for masked language modelling or causal language modelling for the purpose of feature extraction (i.e. embeddings).

I know I can finetune on a corpus for a specific task (like MLM or CLM) but to my understanding that only makes changes in the final layer of the model. I’m trying to pull the last hidden state of a model after embedding encoded text inputs. If I finetune and then pull the last hidden state that would not be any different from doing that to the pretrained model. Is it possible to unfreeze the last two layers of a language model and then finetune it?

Topic		Replies	Views
Finetuning for feature-extraction? I.e. unsupervised fine tuning? Intermediate	10	5563	June 25, 2023
Extract final hidden unit scores after custom fine-tuning language model 🤗Transformers	0	210	July 15, 2022
Finetune only certain embeddings 🤗Transformers	0	14	July 19, 2024
Extracting token embeddings from pretrained language models Beginners	9	22223	May 2, 2024
Can we directly use the embeddings from masked language models? 🤗Transformers	0	750	December 15, 2021

Finetune language model for feature extraction

Related topics