I’m trying to train T5 with unsupervised denoising training method.
I found TextDataset and LineByLineTextDataset in Transformers library while I looking for a dataset to use.
Which one is better in terms of accuracy? or which one is more commonly used?
I think LineByLineTextDataset
will be deprecated soon