Loading finetuned model to generate text

rgwatwormhill · October 19, 2020, 9:03am

what learning-rate did you use for the second lot of training?

Were you using similar training data or very different training data?

Did you save the optimizer state_dict as well as the model state_dict?

If you load your saved (fine-tuned) model, and do a validation check before you start any more training, what kind of validation accuracy do you get?

Topic		Replies	Views
Generate method during finetuning Beginners	6	1941	July 30, 2020
Fine-tune, or train from scratch? Beginners	6	3454	September 16, 2020
How to load finetuned model in TF Beginners	2	450	September 28, 2020
Need help with gpt2 model Beginners	0	585	July 9, 2023
Finetuning GPT2 with user defined loss Beginners	56	16089	July 23, 2023