Why Predict Words Already in the Input?

Ilia09 · August 16, 2024, 3:24pm

As I got it, during the inference, when I input “The quick brown fox,” the model predicts the next word after “The,” then after “The quick,” and so on. Why does it predict tokens that are already in the input? Why doesn’t it start predicting directly after the entire input, like after “The quick brown fox”? If the model predicts a word like “tree” after “The quick brown,” do we continue with “The quick brown tree”? If not, why do we spend computational resources on these predictions?
I’m really struggling with this question, and your help would be greatly appreciated!

Topic		Replies	Views
Special tokens and inference Intermediate	0	334	November 16, 2020
Different results predicting from trainer and model Beginners	6	8030	December 20, 2021
How does NER model learns from the way it is processed during training? Beginners	1	398	August 31, 2020
Is there a pre-trained model that predict the next letter based on the previous letters? 🤗Transformers	0	287	October 15, 2022
Seq2seq decent predict but letter by letter instead of words 🤗Transformers	2	475	August 9, 2022

Why Predict Words Already in the Input?

Related topics