Having fine-tunning as well pre-training together as multi-task

thak123 · December 1, 2020, 3:57pm

Hi

I am interested in having MLM task as well as classification task as single setup .

any leads ?

rgwatwormhill · December 3, 2020, 11:52am

I think it would be simpler to do MLM first and then classification. Is there any reason why you need to define the model with both heads at once?

It is certainly possible (in native pytorch or native tensorflow) to define two different pathways through a model.

When you say “pre-training”, do you mean that you want to train a model from scratch, or are you going to start with a pre-trained model and then do multiple further training steps?

thak123 · December 3, 2020, 1:41pm

Sorry for the using the ambiguous term . I meant using existing weights from already-trained model.

Topic		Replies	Views
Finetuning on MLM task Models	0	659	June 29, 2021
More complex training setups 🤗Transformers	4	1018	October 18, 2020
Training from scratch without any pre-trained MLM model Models	0	289	August 16, 2023
Training T5 on mlm task from scratch 🤗Transformers	4	3263	July 29, 2022
Fill-mask and classification at the same time Beginners	4	803	March 18, 2022

Having fine-tunning as well pre-training together as multi-task

Related topics