How to add RNN layer on top of Huggingface BERT model

aabuzayed · January 18, 2021, 8:14am

I am working on a binary classification task and would like to try adding RNN layer on top of the last hidden layer of huggingface BERT PyTorch model. How can I extract the layer-1 and contact it with LSTM layer?

tokenizer = BertTokenizer.from_pretrained(model_path)
# Load BertForSequenceClassification, the pretrained BERT model with a single linear classification layer on top.
model = BertForSequenceClassification.from_pretrained(model_path, num_labels=len(lab2ind))

Jung · January 19, 2021, 3:10am

We can use BertModel instead of BertForSequenceClassification

https://huggingface.co/transformers/model_doc/bert.html#bertmodel

And feeds hidden states output to LSTM

aabuzayed · January 19, 2021, 9:53am

Thank you for your reply. I am trying this now. But does add LSTM on top of Bert needs to train BERT from scratch?

Jung · January 19, 2021, 12:49pm

You could use BertModel.from_pretrained and then LSTM

Topic		Replies	Views
Correct way to implement custom model on top of pretrained bert? Beginners	0	903	November 19, 2022
PyTorch Bilinear messing with HuggingFace BERT?! Beginners	0	626	February 22, 2022
Using Huggingface Trainer for custom models Beginners	5	4364	May 29, 2023
How to add noise to the intermediate layer of huggingface bert model? Models	0	136	March 27, 2024
Freeze Lower Layers with Auto Classification Model 🤗Transformers	6	18149	May 25, 2023

How to add RNN layer on top of Huggingface BERT model

Related topics