Add data augmentation process during training every epoch

Hi @sgugger

Thank you for the answer!
I now understand for my purpose I should treat a DataCollator or getitem in a Dataset, depending on what I am trying to do, and the Trainer is not an appropriate place in which data augmentation is done.

Referring to your answer in my another question How to use Seq2SeqTrainer (Seq2SeqDataCollator) in v4.2.1 - #5 by sgugger,
I’ll consider the best way to implement to get something randomized in my case.

Thank you so much!

2 Likes