Input Data Format for Inpainting with Changed in_channels in diffusers

Dadadaren · November 9, 2023, 3:32pm

I’m planning to train the stable-diffusion-2-inpainting model using my own dataset consisting of image.jpg and mask.jpg pairs. I understand that the input should have 9 channels. Could you provide guidance on how to preprocess these image and mask files and structure them correctly for training? Specifically:

What is the expected tensor structure for the 9-channel input?
How should I combine my image and mask files to conform to this structure?
Are there any specific preprocessing steps or code examples available?

Any guidance or example code would be very helpful.

Topic		Replies	Views
StableDiffusionInpaintPipeline Tensor Input Error 🧨 Diffusers	2	1441	November 9, 2022
Multi_controlnet + inpaint 🧨 Diffusers	5	3589	November 12, 2023
Custom pipeline for image inpainting Beginners	1	77	October 27, 2024
[Stable Diffusion] Error in "In Painting" pipeline 🧨 Diffusers	5	1816	June 29, 2023
Inverting images/encoding images into noise? 🧨 Diffusers	0	462	September 6, 2022

Input Data Format for Inpainting with Changed in_channels in diffusers

Related topics