Does T5Tokenizer support the Greek language?
When I run the 3 lines of code below, then the input_ids are just 2 and 3 which correspond to the unknown token and the underscore respectively. This is the same for any input text of Greek letters.
from transformers import T5Tokenizer
tokenizer = T5Tokenizer.from_pretrained(āt5-smallā)
input_ids = tokenizer(āĪειά ĻĪæĻ
ĪĻĻμεā, return_tensors=āptā).input_ids