Stable diffusion inpainting 1.5 uses KL autoencoder however paper reports best metric with VQ-VAE

manurare · March 31, 2023, 4:49pm

Hello,

I am wondering if anybody knows why Stable Diffusion reports best results in their paper (Supplemental material, Section D, Table 8) with a VQ-VAE with a codebook size of 8192 and dimension on 3. However in the 1.5 released weights, a KL autoencoder is used. Does anybody know the reason why of this change?

Thanks!

ZongzeWu · December 28, 2023, 9:41am

I have the same question.

Topic		Replies	Views
Jax/Flax VQ autoencoder for Stable Diffusion 🧨 Diffusers	0	469	October 24, 2022
AutoencoderKL.scaling_factor and VaeImageProcessor 🧨 Diffusers	6	4118	August 29, 2023
[Stable Diffusion] Error in "In Painting" pipeline 🧨 Diffusers	5	1818	June 29, 2023
Unet1dmodel for latent image diffusion 🧨 Diffusers	0	382	April 11, 2023
Debugging Custom Stable Diffusion Pipeline for 1D Signal Generation 🧨 Diffusers	1	14	July 2, 2025

Stable diffusion inpainting 1.5 uses KL autoencoder however paper reports best metric with VQ-VAE

Related topics