Reduce the size of the latent space in a VQModel

ad1t7a · October 1, 2023, 12:53am

I was trying to use the VQModel to create a Vector Quantized Autoencoder. The size of the latent space is still the same as the size of the input image. I have a way to reduce the number of dimensions using the parameter latent_channels, but how do I change the dimensions of the encoded space? Say, input was 512 x 512 and I want to have the encoded space to be 64 x 64 as described in the paper.

Topic		Replies	Views
VAE change shape of latent space 🧨 Diffusers	0	317	April 14, 2024
Stable diffusion inpainting 1.5 uses KL autoencoder however paper reports best metric with VQ-VAE 🧨 Diffusers	1	1557	December 28, 2023
Decoding latents to RGB without upscaling 🧨 Diffusers	12	11455	April 23, 2023
Tabular Data Autoencoder Loss Plateau Intermediate	0	360	September 28, 2021
Setting different embedding dim of original model when training Beginners	0	897	June 7, 2023

Reduce the size of the latent space in a VQModel

Related topics