I want to train diffusion model in the same way as it was done in instruct_pix2pix. I have already created triplets of input_image, text_instruction, and output_image, however, I don’t understand how to link an input image to output one in metadata.jsonl. What should be directory layout or the fields metadata file? Currently I have all my input image and output images stored in the directory - generated_data/triplet_dataset/train.
Thanks for suggestion! Could you then steer me in the right direction for publishing the dataset to hub? I want to create exactly the same structure dataset as in here. Any leads are appreciated
to dataset = load_dataset('json', data_files='extracted_dataset/train/metadata.json')
also in preprocess_images I had to add the paths to the images: [convert_to_np(Image.open('extracted_dataset/train/'+image).convert('RGB'), args.resolution) for image in examples[original_image_column]]
If you need to train from pretrained timbrooks/instruct-pix2pix, outcomment adding additional channels line 514: