The DataCollator
can help if you have something randomized in the call that returns the batch. A getitem
in your Dataset can also help, it all depends on what you are trying to do exactly.
The Trainer
in itself has nothing implemented for data augmentation, so it won’t help you.