https://stackoverflow.com/questions/76191862/how-can-i-fine-tune-mbart-50-for-machine-translation-in-the-transformers-python
I found a sample, but it seems that the only way to switch between multiple tokenizers is to use them simultaneously…
There seem to be several methods, such as overriding __getitem__
or passing pre-tokenized data, but…