DistilBERT multiclass classification example

HemanthSai7 · May 22, 2023, 5:01am

def forward(self,input_ids,attention_mask):
      output_1 = self.l1(input_ids=input_ids, attention_mask=attention_mask)
      print(output_1.last_hidden_state.shape)
      hidden_state=output_1[0]
      # assert output_1.last_hidden_state.shape == output_1[0].shape
      pooler=hidden_state[:,0]
      assert output_1.last_hidden_state.shape == pooler.shape
      logits=self.classifier(pooler)
      return logits

Why the need for slicing hidden_state[:,0]. What does this signify? I’m unable to understand this step.

Topic		Replies	Views
Extracting the output of hidden BERT layers and re-training the BERT model on custom datasets 🤗Transformers	0	810	March 17, 2021
`seq_classif_dropout = 0.2` what is the use of adding dropout after the classification network 🤗Transformers	0	103	March 14, 2024
What is the classification head doing exactly? 🤗Transformers	16	24510	November 4, 2024
Pool [CLS] token from DistilBERT 🤗Transformers	1	794	January 18, 2022
DistilBertModel to sequence classification 🤗Transformers	0	234	January 23, 2023

DistilBERT multiclass classification example

Related topics