Padding in datasets

@maximin what was your solution in place of lambda entry: self.tokenizer(entry[ padding=True,)?