Hello everyone, I would like to ask for help with the following:
I want to fine-tuning of a language model for text generation, for this I will use run_lm_fine_tuning.py
I would like to know what is the optimal way to train the model. Can it be trained with a pure csv file, or should the file have some kind of pre-processing?
Or is it necessary to train it with an instance of the Dataset class ?