Embeddings from the Decoder only model

hi @manojkumar427

But embeddings taken by the hidden state values of EOS in last_hidden_layer or concatenation of all tokens hidden state values from last_hidden_layer aren’t performing well using cosine similarity of different prompts.

Can you please give a specific example: what is the output and what do you expect?

1 Like