Hi,
I am following this tutorial: notebooks/language_modeling.ipynb at master · huggingface/notebooks · GitHub
However, I am wondering, how do I add special characters to the tokenizer?
For example, accents such as the following: é, à, è, ù, â, ê, î, ô, û, etc.
Thanks.