But if split by node, the trainer should not skip examples?
So, how do you implement fast processing in DDP?