I’m a little confused about the purpose of predict_with_generate. My training losses are always better when I set this to true but I’m under the impression that it’s not supposed to affect training at all.

Also, if I do not specify a compute_metrics function in the Seq2SeqTrainer but keep predict_with_generate set to True, are the training/validation losses computed differently?