Loss doesn't converge for latent diffusion model

NavneetSajwan · September 25, 2023, 9:34am

I have been trying to train a latent diffusion model, but the loss doesn’t seem to converge. I took inspiration from this example and modified it to input latents instead of images.

Link to my kaggle notebook: Latent diffusion for Monets[Multi-gpu, high res] | Kaggle
Things i’ve tried so far:

model architecture: Since the latents input the unet are much smaller in size comapred to actual images for which the unet was built for, the middle layers might have very small size feature maps. And this might hamper the model’s ability to learn. So , i decreased some layers, didn’t work. In fact average loss increased.

Topic		Replies	Views
Loss drops normally but stops improving quickly 🧨 Diffusers	3	5486	March 9, 2023
Unet1dmodel for latent image diffusion 🧨 Diffusers	0	382	April 11, 2023
Training from scratch 🧨 Diffusers	11	3232	February 20, 2025
Image reconstruction with diffusion model 🧨 Diffusers	0	744	March 9, 2024
Starting Stable Diffusion In The Middle Models	0	242	June 13, 2023

Loss doesn't converge for latent diffusion model

Related topics