I’m using Dataset.from_generator to build large datasets.
Assuming the builder writes incrementally to disk as the dataset is constructed, is there a way to automatically resume in case of an error that disrupts construction?
I’m using Dataset.from_generator to build large datasets.
Assuming the builder writes incrementally to disk as the dataset is constructed, is there a way to automatically resume in case of an error that disrupts construction?