Shoud we add position embeddings to Values

tsaganshosg · December 24, 2024, 12:10pm

In vit_mae, src/transformers/models/vit_mae, there when passing through ViTMAESdpaAttention or ViTMAESelfAttention, position embeddings(PEs) are added before projected by Q-, K- , and V-layers; But many work announce that the PEs should only add to K and Q; Is that a bug?
thank you!

Topic		Replies	Views
Positional embedding in GPT-J when using `past_layer` Models	0	404	January 13, 2023
Use transformer without position embeddings being added? Beginners	0	867	June 13, 2021
What are the goals in Positional Embedding methods? 🤗Transformers	2	502	March 3, 2022
Change Positional Embedding in T5 from Relative to Absolute 🤗Transformers	0	678	May 25, 2022
Positional Embeddings in Transformer Implementations 🤗Transformers	1	1780	September 3, 2024

Shoud we add position embeddings to Values

Related topics