Is last_hidden_state the output of Encoder block?

MahdiA · December 23, 2021, 4:49pm

When we use BertModel.forward() , is the last_hidden_state the output of Encoder in Transformers block?

nielsr · December 23, 2021, 5:47pm

Yes! It’s a tensor of shape (batch_size, seq_len, hidden_size).

Topic		Replies	Views
BERT: What is the shape of each Transformer Encoder block in the final hidden state? Intermediate	7	12854	March 16, 2022
Question about last_hidden_state of the bert model Beginners	0	331	December 7, 2023
MaskedLMOutput does not have last_hidden_state 🤗Transformers	0	1627	May 27, 2021
BertForSequenceClassification: Can I get the last hidden state? Beginners	0	741	January 9, 2023
How to add encoder's last hidden state to GPT2 as encoder-decoder attention Beginners	0	378	January 31, 2023