Extracting the output of hidden BERT layers and re-training the BERT model on custom datasets

devesh1412 · March 17, 2021, 11:53am

Hi All,
I am trying to create a multi-label sentiment analysis classifier(number of classes = 28) and my goal is to:

1.Train the various BERT layers for my specific task.( using a pre_trained tokenizer on BERT_Base or DISTILLBERT )
2. Conduct experiments by extracting the output of hidden BERT layers and combine (adding/averaging) it with the output (‘CLS’) and compare the metrics.

My questions are:

How do I re-train a transformer model ? I do not want to use the pre-trained weights.
How do I extract the output of a hidden transformer block to combine with ‘CLS’ to generate a prediction?

Appreciate any help and pointers!
Thanks,
Devesh

Topic		Replies	Views
Fine tuning bert with tensorflow huggingface transformers 🤗Transformers	1	201	January 28, 2024
Extract final hidden unit scores after custom fine-tuning language model 🤗Transformers	0	210	July 15, 2022
Do we need to load a model twice to get embeddings and probabilities? 🤗Transformers	3	1447	December 18, 2021
How to get word embedding from a TF bert model? 🤗Transformers	0	338	October 1, 2021
DistilBERT multiclass classification example 🤗Transformers	0	288	May 22, 2023

Extracting the output of hidden BERT layers and re-training the BERT model on custom datasets

Related topics