BertTraining Procedure with Pooling Layer

Skyy93 · December 5, 2020, 10:18am

Hello,

I have a question of understanding about BERT training. When BERT is pre-trained, do you already use the so-called pooling layer during the pre-training?

At the moment it seems to me as if the pooling layer is only relevant for sequence classification and I could be wrong. But in the code its standard set to true. What exact purpose has this pooling layer?
Thank you

BR

Topic		Replies	Views
Best Pre-training Strategy Research	0	745	March 3, 2022
Changing pooling method in pre-trained models 🤗Transformers	0	1545	June 19, 2023
BertForSequenceClassification classification head question 🤗Transformers	0	297	July 7, 2022
More complex training setups 🤗Transformers	4	1017	October 18, 2020
Training BERT model from scratch with custom sequence Beginners	0	392	September 21, 2022

BertTraining Procedure with Pooling Layer

Related topics