Hi, I’m curious about whether the dropout is activated when we call model.generate()?
For example, if the model is a T5 model loaded by
model = T5ForConditionalGeneration.from_pretrained('google-t5/t5-base')
By default, the dropout layers in T5 model has p=0.1. I wonder if anyone knows whether the dropout layers are activated when we use model.generate() to generate outputs?
Or maybe model.eval() is called somewhere inside model.generate()?