I want to train diffusion model in the same way as it was done in instruct_pix2pix. I have already created triplets of input_image, text_instruction, and output_image, however, I don’t understand how to link an input image to output one in metadata.jsonl. What should be directory layout or the fields metadata file? Currently I have all my input image and output images stored in the directory - generated_data/triplet_dataset/train.
Thanks for suggestion! Could you then steer me in the right direction for publishing the dataset to hub? I want to create exactly the same structure dataset as in here. Any leads are appreciated