Why are Initial latents weighted by mask only with unet nchannels=4?

manurare · June 6, 2024, 3:43pm

Hi,

I am confused. Why in the StableDiffusionInpaintPipelineV2 a linear interpolation between denoised latents and initial latents weighted by the mask only happens when the number of channels of the unet is 4 and not if it’s 9? Here is the line

Wouldn’t it make sense to add noise only where mask==1 and leave the rest as the initial latent since in those regions we don’t need to generate any content?

Topic		Replies	Views
Why StableDiffusionInpaintPipeline do not provide the "strength" parameter? 🧨 Diffusers	1	1748	March 9, 2023
[Stable Diffusion] Error in "In Painting" pipeline 🧨 Diffusers	5	1816	June 29, 2023
Set latents in StableDiffusionInpaintPipeline to original image 🧨 Diffusers	1	619	May 17, 2024
Debugging Custom Stable Diffusion Pipeline for 1D Signal Generation 🧨 Diffusers	1	14	July 2, 2025
Starting Stable Diffusion In The Middle Models	0	242	June 13, 2023

Why are Initial latents weighted by mask only with unet nchannels=4?

Related topics