For the purpose of visualizing work in progress on the image inference pipelines, I’d like to convert the latents to a display format at each step along the way. But I find that if I run them through the pipeline’s VAE decoder, that takes far too much time that could be better spent on the pipeline’…

After some empirical tests, I have determined that I can get a useful approximation of the RGB output using a linear combination of the latent channels. [sd-latent-channels-linear-approximation] This approximation comes from multiplying the four latent channels by these factors: v1_4_rgb_latent_f…

Decoding latents to RGB without upscaling

🧨 Diffusers

pcuenq September 22, 2022, 11:16am 4

That’s pretty amazing, do those numbers work for all types of images? I wonder why your initial experiment to use a smaller decoder wouldn’t work, it sounds like a reasonable idea to me!

Topic		Replies	Views
How to get intermeidate output images 🧨 Diffusers	4	3253	March 18, 2025
AutoencoderKL.scaling_factor and VaeImageProcessor 🧨 Diffusers	6	4128	August 29, 2023
Set latents in StableDiffusionInpaintPipeline to original image 🧨 Diffusers	1	621	May 17, 2024
Why are Initial latents weighted by mask only with unet nchannels=4? Research	0	118	June 6, 2024
Unet1dmodel for latent image diffusion 🧨 Diffusers	0	383	April 11, 2023

Decoding latents to RGB without upscaling

Related topics