How do I add new separate layers to a pretained model to add a modality?

nolestock · July 9, 2023, 7:42am

I want to use pytorch and the hugging face Library to load a pre-trained model and freeze the weights. Then I want to add new Transformer blocks b that take in the hidden States from each of the pre-trained Transformer layers at layer B and the previous output from the added blocks at b -1.

Conceptually it’s sort of like Lora but instead of just adapting the pre-trained LLM I would like to add a modal input to these “side layers”.

Is this a dumb idea? Will this not work? What am I missing?

I’m too stupid to figure out how to add a modality to open flamingo so I was trying to naively figure out my own example.

Topic		Replies	Views
Adding linear layer to transformer model (+ save_pretrained and load_pretrained) 🤗Transformers	1	3718	March 10, 2022
Custom modification on transformers 🤗Transformers	1	156	June 13, 2024
How to add RNN layer on top of Huggingface BERT model 🤗Transformers	3	4524	January 19, 2021
Save custom transformer as PreTrainedModel Intermediate	1	929	September 7, 2021
PyTorchModelHubMixin Help Beginners	4	504	June 5, 2024

How do I add new separate layers to a pretained model to add a modality?

Related topics