Training on Domain specific Dataset

devesh1412 · March 21, 2021, 12:22am

Hi ,
I want to train a sentiment analysis multi-label classifier and in addition to training the final output layers, I’d like to train the hidden BERT layers as well.
I want to understand how much improvement can I get in my metric(F1 score) by feeding it my domain-specific data.
All the documents /references I have seen thus far only point to training the final output layer that generates classification. Is there a way to train various hidden layers of BERT using (let’s say) BERT Base?
Thanks in advance,
Devesh

neuralpat · March 21, 2021, 9:24am

My understanding is that if you don’t specifically freeze any of the layers you will always train the whole model.

If you want to train only particular layers, you can add a condition to this code:

model = BertForSequenceClassification.from_pretrained('bert-base-uncased')

for param in model.bert.parameters():
    param.requires_grad = False

devesh1412 · March 21, 2021, 5:34pm

Hi @neuralpat ,
Thanks so much for taking the time and responding to my queries.I’m going to try it out.
I am also wondering if there is a way to freeze specific layers(eg. Top 3 ) and train ?

assansanogo · March 22, 2021, 9:15pm

Hi,

model = BertForSequenceClassification.from_pretrained('bert-base-uncased')

for param in model.bert.parameters():
    param.requires_grad = False

as far I understand, the code above is for ALL params
but you could easily limit to 3 elements :

model.bert.parameters() is a generator

i=0
for i,el in enumerate(model.parameters()):
  if i<3:
    el.requires_grad=False
  else:
    el.requires_grad=True

You can check the result with the result with:

for param in model.parameters():
    print(param)

Topic		Replies	Views
How to freeze layers while fine-tuning? 🤗Transformers	2	201	May 16, 2025
How to freeze layers using trainer? Beginners	11	32053	May 26, 2024
Fine-tuning BERT Model on domain specific language Models	1	1800	January 5, 2021
Pretraining Models from Scratch vs Further Training 🤗Transformers	0	269	November 28, 2023
The point of using pretrained model if I don't freeze layers Beginners	1	8550	May 31, 2023

Training on Domain specific Dataset

Related topics