Non shuffle training

Refer to thread here: How to ensure the dataset is shuffled for each epoch using Trainer and Datasets? - #3 by lhoestq