Map with tokenize function stuck in the beginning

Maybe try IterableDataset?