Chapter 2 questions

dberry1023 · March 29, 2024, 5:35pm

How do I know which tokenizer to choose?
Example 1.
"The dog’s ran into the church. "
model 1: [ The, dog’s, ran, into, the, church]

model 2: [ The, dog, 's, ran, into, the, church]

This provides 2 different meaning to a model. How do I know to choose a tokenizer that store the whole word or breaks down the parts of a word?

Topic		Replies	Views
Ai Agents course error in running the Smolagent example Course	14	1478	June 2, 2025
Avoiding the usage of HfApiModel and using local model - `smolagents` Beginners	7	1093	May 2, 2025
Payment Required huggingface...Qwen2.5-Coder-32B-Instruct Beginners	2	197	April 21, 2025
Invalid credentials in Authorization header - HfApiModel Course	5	2616	August 6, 2025
Function/tool calling using Transformer models 🤗Transformers	5	1024	July 17, 2025