Add data augmentation process during training every epoch

The DataCollator can help if you have something randomized in the call that returns the batch. A getitem in your Dataset can also help, it all depends on what you are trying to do exactly.

The Trainer in itself has nothing implemented for data augmentation, so it won’t help you.

2 Likes