Adding a New tokens to ViT

tankwell · March 10, 2023, 2:53pm

Hey,

I want to add 2 custom embeddings (tokens) to a pretrained transformers
Those tokens will represent some argument to the image
I have seen there is a similar method to BERT and text transformers (add a word to the vocabulary) but I did not find something for image transformers

Currently I do it with ugly code that overrides the embeddings of the built-in Vit and adds the token.
If you have a smarter solutions / similar ideas I would be happy to hear

Thanks

Topic		Replies	Views
Vision Transformer embeddings interpolation 🤗Transformers	0	366	July 6, 2022
Soft prompt learning for BERT and GPT using Transformers 🤗Transformers	4	3813	July 31, 2023
How can I make a Img2Text transformer using the existent modules? Intermediate	1	821	October 21, 2021
Using trasnsformer to get image features 🤗Transformers	3	3346	March 20, 2024
Using Huggingface for computer vision (Tensorflow)? 🤗Transformers	3	413	June 2, 2025

Adding a New tokens to ViT

Related topics