Why the output of the UNet is noise?

nongnongzi November 14, 2023, 9:08am 1

In the stable diffusion model or other diffusion model, since the input of the UNet is (noisy_image, text_embedding, time_embedding), why the output of it is the noise, but not an denoised image?

In the traditional UNet for image

Topic		Replies	Views
A few questions about how (vanilla) diffusion works Beginners	1	893	September 25, 2022
Why I get this UNet generated latents result been so messy? Beginners	3	83	May 31, 2024
Why is the loss of Diffusion model calculated between "RANDOM noise" and "model predicted noise"? Not between "Actual added noise" and "model predicted noise"? 🧨 Diffusers	12	5548	November 27, 2023
Why using ground-truth noise in a diffusion model does not work? Beginners	0	359	August 1, 2023
Unet1dmodel for latent image diffusion 🧨 Diffusers	0	392	April 11, 2023

Why the output of the UNet is noise?

Related topics