How to create a new large Dataset on disk?

The plan is mostly to ask for a generator function. It’s probably the easiest way to ask for possibly large data without filling up your RAM. TensorFlow Datasets already implements from_generator this way.

Then from_list is also a good one for in memory data.