I just read up, padding is the job of DataCollator which pads -100 so that our loss function ignores it. Is this correct?