The plan is mostly to ask for a generator function. It’s probably the easiest way to ask for possibly large data without filling up your RAM. TensorFlow Datasets already implements from_generator
this way.
Then from_list
is also a good one for in memory data.