Doing classification 100% from scratch?

olaffson · September 17, 2021, 4:25pm

@nielsr just a follow up if you have a moment. The TF notebook for language modeling actually mention two different tasks: causal language modeling and masked language modeling.

For the purpose of training a classifier on top of the model I train from scratch, are the two basic tasks equivalent? That is I can train a causal language modeling and then train a classifier with it or I can train a masked language model and then train the classifier. Are both approaches OK conceptually?

Thanks!

Topic		Replies	Views
Training a language model from scratch with tensorflow (not pytorch)? Intermediate	4	852	August 9, 2021
Further pre-train language model in transformers like BERT Models	3	1108	March 27, 2022
SpanBERT, ELECTRA, MARGE from scratch? Beginners	5	1371	July 22, 2023
Saving underlying language model after trained on downstream task 🤗Transformers	0	420	September 14, 2020
Training BERT from scratch with Wikipedia + Book Corpus Dataset 🤗Transformers	1	4624	January 22, 2021

Doing classification 100% from scratch?

Related topics