Embeddings from the Decoder only model

manojkumar427 · September 20, 2024, 1:36pm

I am trying to extract the embedding from a decoder only LLM. I tried using hidden states by appending EOS token to input and passing it to model. But embeddings taken by the hidden state values of EOS in last_hidden_layer or concatenation of all tokens hidden state values from last_hidden_layer aren’t performing well using cosine similarity of different prompts.

Is there any way to extract the embedding from decoder only model in order to compare the different prompts.

mahmutc · September 20, 2024, 2:34pm

hi @manojkumar427

But embeddings taken by the hidden state values of EOS in last_hidden_layer or concatenation of all tokens hidden state values from last_hidden_layer aren’t performing well using cosine similarity of different prompts.

Can you please give a specific example: what is the output and what do you expect?

MattiLinnanvuori · September 22, 2024, 9:21am

Stack Overflow post shows several ways to do it.

manojkumar427 · October 8, 2024, 7:54am

Hi @mahmutc ,
Thanks for your intereset,
I am trying to generate vector representation of prompt using decoder only model, so input would be the prompt/sentence and output would be the vector representing prompt/sentence. We can use these vectors for comparing with other prompt/sentence. It would be a great work since decoder only models are rapidly evolving.

manojkumar427 · October 8, 2024, 8:09am

Hi @MattiLinnanvuori ,
Thanks for the result, I tried both of them, first method(wgt avg pooling has an issue of length: resultant vector depends on length of prompt) and second method also didn’t perform well , I think paper has proper explanation. Working over it.
Thanks!
Great help.

CrisTiru · March 26, 2025, 4:36pm

Hi, I wanna to do same job for Gemma 3 , @manojkumar427 did you find out?

Topic		Replies	Views
How to get token-embeddings of input with decoder-only models? Beginners	1	522	September 20, 2024
Get output embeddings out of a transformer model 🤗Transformers	4	4080	July 20, 2021
Get word embeddings from transformer model Beginners	1	13960	June 17, 2021
Image captioning decoder Languages at Hugging Face	4	1484	January 6, 2022
What should be used as sentence embedding for BertModel? Beginners	0	1926	May 24, 2021

Embeddings from the Decoder only model

Related topics