This is a beginner question but
How does huggingface infer what type of method to use for tokenizing the sentence? Can you point me where the implementation could be?
Thanks
1 Like
Maybe simply written in a json file in the same folder.
not really, i couldnt find it
Oh, you’re looking for code. The following is called based on the YAML set in the model repo and the json as above. in Llama’s case, this is it.