I want to run some experiments using data from the pile, but don’t have nearly enough space for that much data. Is there an easy way to download only a small portion of the dataset?
I want to run some experiments using data from the pile, but don’t have nearly enough space for that much data. Is there an easy way to download only a small portion of the dataset?