I think it’s better to download the data first, create a dataset that includes the actual images, and then upload it, as this reduces the risk of encountering download errors during training. However, either method should work. Additionally, there is an option to create a script for loading the dataset.
Ultimately, you need to decide whether to have the Trainer’s DataCollator download the data from the URL or to prepare the dataset in advance and use the datasets library.