Intermediate Fine-tuning vs Domain Adaptive Pretraining vs Task Adaptive Pretraining

Ghada1997 · December 8, 2023, 12:22am

I want to apply transfer learning to classify citizen reviews about government services into their relevant government sectors.

I have around 4K labeled citizen reviews as target data and around 30K newspaper articles that are labeled with labels similar to the target data, like healthcare, education, etc. In this case I have four options and I want an advice about the best approach to follow

Apply intermediate fine tuning using a BERT model on the newspaper articles before fine tuning on the target data?
Apply domain adaptive pretraining on the newspaper articles without the labels through further pretraining BERT model, then fine tune the model on the target data?
Apply Task adaptive pretraining on a portion of the citizen reviews without the labels through further pretraining BERT. Then, fine tune the model using another portion of the target data with its labels.
Combine approaches 2 and 3.

P.S. I will compare the chosen approach to direct fine tuning BERT on the target data.

Topic		Replies	Views
Domain Specific Pretraining using BERT models vs other smaller architecture models 🤗Transformers	0	210	December 7, 2023
Pretraining Models from Scratch vs Further Training 🤗Transformers	0	269	November 28, 2023
Suitable Data for Task Adaptive Pretraining (TAPT) 🤗Transformers	0	194	December 4, 2023
Using EXTREMELY small dataset to finetune BERT 🤗Transformers	6	13102	February 1, 2023
Multi-class Classification Basics Beginners	4	4546	August 24, 2021

Intermediate Fine-tuning vs Domain Adaptive Pretraining vs Task Adaptive Pretraining

Related topics