How to modify the internal layers of BERT

Jung · November 14, 2020, 2:25am

Hi, I am not a good coder on Pytorch, but I can give some rough ideas to fix the error.

First, if I understand your objective correctly, you should extract the pretrained embedding output (not redefine it with FC_Embeddings like you do). So you should send your input to Bert’s pretrained embedding layer. (send input_ids to get the embedded output, let named it x.)

Secondly, only here, that you can use your kwargs['fc_idxs'] to do what you want with x to get your designed output, let simply named it, y.

Then, after this point, you can send y to self.bert's upper layers (not include embdding layer) but not send kwargs['fc_idxs'] to self.bert since it doesn’t know this parameter.

NOTE: to send the embedded vector to self.bert's upper layers, you need to input inputs_embeds instead of input_ids.

Please see the manual for reference on inputs_embeds vs. input_ids :

And you can see my Tensorflow example doing exactly like this.

Topic		Replies	Views
Modify bert embeddings 🤗Transformers	0	387	January 18, 2022
New layer in bert embeddings 🤗Transformers	1	707	April 1, 2022
Modify BERT encoder layers? 🤗Transformers	0	1034	June 18, 2021
New Layer in BERT 🤗Transformers	0	210	September 25, 2022
How to add a new input layer to BERT / RoBERTa? Beginners	0	921	April 26, 2022

How to modify the internal layers of BERT

Related topics