datasets
uses shutil.disk_usage()
to know if there is enough space before writing a (potentially huge) dataset
(maybe if it says zero it can let the writing begin - it should fail anyway if it’s really zero under the hood ? that part of datasets
is open to contributions btw if you want to improve it)