After I built a custom Tokenizer, if I have more data to train, should I call the train()
or train_from_iterator()
again on the same saved tokenizer? will the original trained tokenizer be overwritten?
After I built a custom Tokenizer, if I have more data to train, should I call the train()
or train_from_iterator()
again on the same saved tokenizer? will the original trained tokenizer be overwritten?