Hello,
It is possible to do lower case to given input ids without decode and then encode again ?
for example
tokenizer = AutoTokenizer.from_pretrained('roberta-base')
text1 = tokenizer.decode([713, 16, 10, 3645, 4])
print(text1)
>>> This is a sentence.
text2 = tokenizer.decode([9226, 16, 10, 3645, 4])
print(text2)
>>> this is a sentence.
I would like to know if there is some fast way to convert the id 713 to 9226, without decode, do lower and then encode again.
Thanks,
Shon