Could there be an "remove noise" function to remove noise from noisy_latents, given the noise and the timestep?

KnightOnLlama · August 3, 2023, 12:57pm

When using “epsilon” prediction mode I notice that the model predicts a pre-defined noise with normal distribution. Since the noise as well as the noisy_latents is too ambiguous for any sementic loss (like measuring the clip feature distance between generated image & text prompt) during training, I wonder if there’s a way to directly substract the noise from the noisy_latents (instead of denoising step by step), ending up a pure latent without noise?
I guess this is practicable because noisy_latents looks like some weighted addition of latents and noise inside scheduler.add_noise function, but my poor math cannot afford to invert this process. May some kind-hearted guy help me?

Topic		Replies	Views
Why target is noise data when calculating loss? 🧨 Diffusers	3	379	April 18, 2024
Inverting images/encoding images into noise? 🧨 Diffusers	0	462	September 6, 2022
Why the output of the UNet is noise? 🧨 Diffusers	0	272	November 14, 2023
Diffusers documentation has some error code,i have fixed it 🧨 Diffusers	1	420	May 26, 2023
DDIM v prediction problem 🧨 Diffusers	1	3211	September 5, 2023

Could there be an "remove noise" function to remove noise from noisy_latents, given the noise and the timestep?

Related topics