Partial modification of the "instruct-pix2pix" model

Montemusso · September 2, 2024, 3:03pm

Hello everyone, I am a master’s student working on my final thesis. I’ve been asked to conduct tests by modifying the text attention function used by the UNet in instruct-pix2pix, specifically by trying out the SwiGLU and ReGLU functions. Unfortunately, I’m not sure how to proceed.

Steps I’ve tried:

Replacing the UNet (I encountered compatibility errors despite using an object of the correct type)
Using the network available on GitHub (unfortunately, without optimizations, I don’t have the necessary resources)
Thank you very much for your help.

P.S. I am open to any suggestions. Unfortunately, I’ve been left to figure things out on my own, as this is one of the first projects of this kind for my professors who are specialized in NLP tasks.

Topic		Replies	Views
Add additional trainable layers to StableDiffusion for fine-tuning 🧨 Diffusers	0	1014	October 8, 2023
Pass additional information into Key and Value weights of Stable Diffusion 🧨 Diffusers	1	1059	January 29, 2024
How to train stable diffusion with different channel number in unet? 🧨 Diffusers	0	238	February 15, 2024
How to add additonal attention layer in pretrained U-Net? Models	0	731	February 25, 2023
Training of diffusion 🧨 Diffusers	0	247	May 10, 2023

Partial modification of the "instruct-pix2pix" model

Related topics