Processing Large Dataset for Training GPT2 model

Hi ! Have you tried increasing preprocessing_num_workers ?