Can not understand the sequence length and hidden size of the BEiT model

Atiqur · July 27, 2023, 3:33pm

The output shape of the last_hidden_state is [8, 197, 768]. As far as I know, the first is the batch size, the second is the sequence length and the last is the hidden size. But I am unclear about the sequence length and the hidden size. The code is given below:

with torch.no_grad():
    outputs = model(batch['pixel_values'])

print(outputs.last_hidden_state.shape)

torch.Size([8, 197, 768])

Can anyone explain what actually sequence length and hidden size mean? Thanks in advance.

Topic		Replies	Views
Question about query_length in modeling_t5.py Beginners	0	253	April 18, 2022
Size of last_hidden_state and mask in ViTMAE Beginners	2	339	January 23, 2024
VivitModel last hidden states dimension Problem 🤗Transformers	0	48	July 11, 2024
Hidden_states Transformers for computer vision 🤗Transformers	0	423	July 21, 2022
How to get a fixed size embedding from the last hidden state of vision models? 🤗Transformers	0	795	April 28, 2022

Can not understand the sequence length and hidden size of the BEiT model

Related topics