Hello everyone,
I downloaded few datasets and quickly ran out of disk space…
I was wondering whether I actually still need those download files for later usage of the dataset, or I can delete it’s content to free some space?
My datasets
memory layout is as follows:
And I specifically asks about the ./downloads
directory, which consist of files such as:
editing:
I’m mostly running the run_mlm.py
script using those downloaded datasets.
In case I use the flag --dataset_cache_directory
and specify the directory of the dataset in my files (which located under my .cache directory), do I still need the content of download
directory?
Would appreciate any help on the topic