Creating a tokenizer with both custom tokens and positions

jcoke · April 21, 2022, 7:38am

Hey!
I’m trying to train a BART model with customized positional embeddings, similarly to what you have been doing and I have a few questions that you perhaps can help me with. First of all, say that I want to change the positional embeddings of BART to sinusoidal embeddings, just like you did @bengul, is that even possible? - My intuition is that we have to re-learn so many parts of the Transformer architecture that it might not be worth doing, or am I wrong here? My second thought is based on the assumption that it actually works, i.e. it is possible to change the positional embeddings, what kind of computing resource is necessary to be able to do this change in how the model treats the positions?

Hope you can help me with some insight in this @bengul @cowszero

Thanks!

Topic		Replies	Views
How to use custom positional embedding while fine tuning Bert Beginners	2	2798	September 14, 2022
`BertEmbeddings` contains positional embedding? 🤗Transformers	2	3168	December 27, 2022
Trying to process longer documents with BERT-based models Intermediate	0	623	March 8, 2021
Positional Embeddings in Transformer Implementations 🤗Transformers	1	1817	September 3, 2024
BartForConditionalGeneration: Adding additional layers of embedding Models	2	189	July 11, 2024

Creating a tokenizer with both custom tokens and positions

Related topics