Understanding how to implement custom BERT model

RajSingh333 · November 22, 2021, 11:19pm

I was trying out a text classification problem. Text can be made up of 2+ sentences. I am having feeling that I should obtain BERT embedding of each of them separately. Then run pass them through neural network to obtain features of each sentence and then concatenate all of them together. Finally I will classify using binary classifier.

I was thinking to do this in pytorch. Based on task at hand, I have intuition that obtaining BERT embedding of each sentence separately will be more benefitial.

I have following doubts:

Is there any example taking such approach?
I am thinking of what should I do? Should my model inherit nn.Module from pytorch or BertPreTrainedModel from huggingface transformers.
How I can implement this concatenation logic in __init__ method for building model and in forward method for passing input sentences through corresponding neural network one by one?
What line pooled_output = outputs[1] is doing in BertPreTrainedModel? Is it obtaining reference to embedding corresponding to [CLS] token?

Topic		Replies	Views
How can I implement this BERT model for sequential sentences classification using HuggingFace? Beginners	1	797	September 10, 2023
What should be used as sentence embedding for BertModel? Beginners	0	1911	May 24, 2021
How to concatenate additional features to the last layer of Bert 🤗Transformers	6	2380	January 14, 2024
How to get [CLS] embeddings from BertForTokenClassification model Beginners	3	15232	November 27, 2023
How to modify the internal layers of BERT 🤗Transformers	12	16490	July 19, 2023

Understanding how to implement custom BERT model

Related topics