Hi, I want to do some language model pre-training, using the Trainer API.
Assume I have two datasets wikitext
and bookcorpus
. I want to first train on wikitext
and then on bookcorpus
, and I want to save the checkpoint after training on wikitext
, then continue training on bookcorpus
and save the later checkpoints.
I wish to have the checkpoints something like this:
checkpoint-500 (only wikitext)
checkpoint-1000 (only wikitext)
checkpoint-1500 (only wikitext)
checkpoint-1800 (finished training on wikitext)
checkpoint-2300 (continue training on bookcorpus)
...
I don’t want to mix the two datasets together, because I want to analyse what’s the difference after training on another dataset. I want to know how to achieve this?
Could anyone help me?