Hi everyone,
I have a dataset in which sentences have been segmented into words. How do I use it to train a BPE or SentencePiece tokenizer?
Thank you
Hi everyone,
I have a dataset in which sentences have been segmented into words. How do I use it to train a BPE or SentencePiece tokenizer?
Thank you