Embeddings from llama2

jasperlp · October 11, 2023, 2:29am

Just found the why and how for this question.

Why:
The pipeline generally returns the first available tensor, which refers to the logits in the Llama model
Ref:

How:
Instead of using the pipeline for efficiency and neat codes,
use

model(torch.IntTensor([tokenizer(sentences)['input_ids'][0]]),return_dict=True, output_hidden_states=True)['hidden_states']

you can get the hidden states from all the layers (including the embedding layer) for each token,
you will get for the first sentence

len(embeddings['hidden_states']), embeddings['hidden_states'][0].shape
(33, torch.Size([1, 4, 4096]))

Topic		Replies	Views
Getting the same embedding from llama 2 class token for any input 🤗Transformers	1	1287	December 4, 2023
Llama model outputs strange words Beginners	0	129	December 1, 2024
Extracting sentence embeddings from NLP models from each layer seperately Beginners	0	718	August 18, 2021
What is an embedding? Intermediate	4	983	July 22, 2024
Extracting token embeddings from pretrained language models Beginners	9	22043	May 2, 2024