Data2vec: average target vector computation not shown in code

chandresh · May 17, 2022, 10:47am

Hi All,

I was going through the code of data2vec for text. In their paper, the author mentioned to compute targets by averaging the output of top K block of encoders. However, I don’t find any comment in the code doing that. Can you point me out which line of code perform that computation? Also, the loss function mentioned in the paper is L1 loss and MSE. But the same is not visible in code.

Topic		Replies	Views
Are there any smart loss functions for a sequence of float vectors? 🤗Transformers	0	148	January 7, 2024
Wav2Vec2 Loss Function Question 🤗Transformers	1	179	July 24, 2024
Don't average the loss Models	1	615	March 30, 2024
Match data2vec outputs 🤗Transformers	0	167	April 5, 2023
Gradient accumulation loss compute Beginners	0	75	June 4, 2024

Data2vec: average target vector computation not shown in code

Related topics