Anyway, I think that using from_generator or from_list will use up too much RAM. If you want to create a very large dataset, you can define a script to load the dataset, but if you can limit the dataset to images, you might be able to use the following method.
In short, you can create a dataset just by uploading it to HF in separate directories.