If I am not mistaken, there are two types of trainers in the library. The standard trainer and the seq2seq trainer.
It seems that the Trainer works for every model since I am using it for a Seq2Seq model (T5).
MY question is: What advantages does seq2seq trainer have over the standard one?
And why does not the library handle the switch in the background or does it?
I mean that the user can use Trainer all the time and in the background, it will be a seq2seqtrainer if the corresponding model needs it.