Resume Training / Finetune a language model and further finetune a classifier

pentagroom · October 19, 2020, 12:51am

Hi,
I would like to finetune a powerful classifier based on a pre-trained language model. As we know, the typical approach is to fine-tune a classifier using a pre-trained model. What I am wondering is that, if I fine-tune a pre-trained model based on a fine-tune language model settings using DS1(typical text dataset) (OR resume training from the last checkpoint) and then further fine-tune this newly fine-tuned model using another DS2(typical text dataset) for a classifier purpose, would this be a redundant effort as compared to a pipeline which is to just finetune a pre-trained model using DS2? I would like to receive your thoughts.

Thank you.

Jung · October 19, 2020, 3:15pm

Hi, there are papers indeed indicate that “multi-steps” finetuning is helpful. See this paper for one example .

Topic		Replies	Views
Separate LM fine tuning and classification head training Beginners	5	1860	July 1, 2021
Continue Pre-Training Roberta Intermediate	3	2689	May 18, 2023
Fine-tune, or train from scratch? Beginners	6	3454	September 16, 2020
Dataset parameters to finetune a pretrained translation model on new vocabulary Models	0	365	July 5, 2023
Custom tokenizer: finetune model or retrain model? 🤗Transformers	1	918	March 8, 2024

Resume Training / Finetune a language model and further finetune a classifier

Related topics