Finetunning on a new corpus for Conditional Generation. Should I train from scratch?

rdemorais · February 21, 2023, 2:51am

Hello guys,

I’m trying to figure out the best strategy to pursue. The scenario is the following:

I’d like to use FLAN-T5 as the base model
I have a corpus in Brazilian Portuguese, so I want to fine-tune the original model (for Conditional Generation, not for a specific task that will be achieved later).

What to do from now on? Should I train a custom tokenizer and start from scratch, or should I leverage all that is in place (model and tokenizer) and just run some epochs to get the model up and running on my language?

I understand if I go from scratch, I’m losing all training and knowledge stored in the model, but if that is the way to go if one wants to change or specialize for his language, it is ok. Your thoughts will be very much appreciated.

Thank you so much for this.

Topic		Replies	Views
Fine-tune, or train from scratch? Beginners	6	3454	September 16, 2020
Custom tokenizer: finetune model or retrain model? 🤗Transformers	1	918	March 8, 2024
Finetuning a model for machine translation on a programming language Models	1	648	November 29, 2023
Doing classification 100% from scratch? 🤗Transformers	4	1717	September 17, 2021
Fine-tune T5 model for Casual Language Modeling(CLM) Models	1	754	April 26, 2023

Finetunning on a new corpus for Conditional Generation. Should I train from scratch?

Related topics