Chapter 2 questions

zongxiao · August 30, 2023, 8:53am

Try it out! Replicate the two last steps (tokenization and conversion to input IDs) on the input sentences we used in section 2 (“I’ve been waiting for a HuggingFace course my whole life.” and “I hate this so much!”). Check that you get the same input IDs we got earlier!

decoded_string = tokenizer.decode([101, 146, 112, 1396, 1151, 2613, 1111, 170, 20164, 10932,
2271, 7954, 1736, 1139, 2006, 1297, 119, 102])
print(decoded_string)
[CLS] I’ve been waiting for a HuggingFace course my whole life. [SEP]

decoded_string = tokenizer.decode([146, 112, 1396, 1151, 2613, 1111, 170, 20164, 10932, 2271, 7954, 1736, 1139, 2006, 1297, 119])

print(decoded_string)

I’ve been waiting for a HuggingFace course my whole life.

what’s the difference?

Topic		Replies	Views
Ai Agents course error in running the Smolagent example Course	14	1473	June 2, 2025
Avoiding the usage of HfApiModel and using local model - `smolagents` Beginners	7	1074	May 2, 2025
Payment Required huggingface...Qwen2.5-Coder-32B-Instruct Beginners	2	197	April 21, 2025
Invalid credentials in Authorization header - HfApiModel Course	5	2583	August 6, 2025
Function/tool calling using Transformer models 🤗Transformers	5	1016	July 17, 2025

Chapter 2 questions

Related topics