What is the difference between forward() and generate()?

anon2764274 · September 23, 2021, 4:04am

Hi!

It seems like some models implement both functions and semantically they behave similarly, but might be implemented differently? What is the difference? In both cases, for an input sequence, the model produces a prediction (inference)?

Thank you,

wilornel

nielsr · September 23, 2021, 7:36am

Hi,

forward() can be used both for training and inference. Forward refers to a single forward pass through the network. During training, we apply a forward pass to get the model’s predictions, and then do a backward pass to compute the gradients of the parameters with respect to the loss, which we then update. We then do another forward pass, followed by another backward pass etc. This is typically done on batches of data.
generate() can only be used at inference time, and uses forward() behind the scenes, in a sequence of time steps (see this post for a simple showcase of that). The first forward is used to predict the first token, next we append the predicted token to the input of the next time step, which again uses forward() to predict the next token, and so on. This is called autoregressive generation. There are decoding strategies to decide which next token to take as prediction such as beam search, top k sampling, and so on (a detailed blog post can be found here).

alecstash · February 17, 2023, 7:08am

@nielsr Can you provide any insight as to why one would prefer to use one over the other.

For example, I am realizing that using generate() we are not able to obtain the model loss at inference time. If I can also generate the text sequence using forward(), i’d rather just use that. I feel there is something else at play though.

nielsr · December 25, 2023, 9:08pm

The generate method is more feature complete with various fancier decoding methods besides greedy decoding, such as beam search and top-k sampling.

Topic		Replies	Views
The output of T5 is not consistent on multiple sequences 🤗Transformers	1	867	May 11, 2022
Pipeline vs model.generate() Beginners	10	13858	July 6, 2025
Llama-2 output from forward function is nonsense, `.generate()` is okay 🤗Transformers	3	1030	May 27, 2024
Is model.generate slower than model forward call? 🤗Transformers	1	168	August 18, 2024
Understanding Output of `PreTrainedModel.forward` Beginners	2	1933	February 12, 2024

What is the difference between forward() and generate()?

Related topics