How to extract encoding before classification layer?

SpaceHunterInf · February 21, 2023, 8:38pm

Greetings,

So I am using a RobertaForSequenceClassification model for natural language inference (ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli · Hugging Face). How may I obtain the encoding of my sequence before it passes through the final classification layer?

Here are what I have tried:
Firstly I identified all the parameters
[ …
‘roberta.encoder.layer.23.output.LayerNorm.weight’,
‘roberta.encoder.layer.23.output.LayerNorm.bias’,
‘classifier.dense.weight’,
‘classifier.dense.bias’,
‘classifier.out_proj.weight’,
‘classifier.out_proj.bias’]

Then I use the hidden state flag to obtain the hidden state vectors with a single input (batch_size =1).
outputs =model(input_ids,attention_mask=attention_mask,token_type_ids=token_type_ids,output_hidden_states=True, labels=None)
outputs[1] is the hidden states which has size [1,23,1024], 1024 is my hidden state dim

t = []
matrix = None
bias = None
dense_m = None
dense_b = None

for name, param in model.named_parameters():
    if name == 'classifier.dense.weight':
        dense_m = param
    if name == 'classifier.dense.weight':
        dense_b = param
    if name == 'classifier.out_proj.weight':
        matrix = param
    if name == 'classifier.out_proj.bias':
        bias = param

test = outputs[1][-1][-1][-1]
t = torch.mm(dense_m, torch.unsqueeze(test, dim=1)).T + dense_b
print(torch.mm(matrix, t).T + bias)
print(outputs[0])

The final output has shape [1,3], but mine is [23,3] and the last row is not equal to the original output. What have I done wrong or what should I do in this case?

Thank you very much!

Topic		Replies	Views
T5 sequence classification 🤗Transformers	1	933	May 8, 2022
How to get [CLS] embeddings from BertForTokenClassification model Beginners	3	15235	November 27, 2023
Do we need to load a model twice to get embeddings and probabilities? 🤗Transformers	3	1448	December 18, 2021
Embedding layer or last hidden_hidden_state 🤗Transformers	0	212	November 1, 2023
Can hidden states be passed instead of input_ids or inputs_embeds in Transformers OpenAI GPT2 🤗Transformers	0	485	July 6, 2021

How to extract encoding before classification layer?

Related topics