How to have one pipeline to perform text2img, img2img with shared Stable Diffusion model?

batrlatom · September 8, 2022, 4:02pm

Hi,
I would like to deploy SD on the server and accomplish text2img and img2img tasks.
If the image is present, then it would perform img2img, if not, it would do the text2img.
What I would like to do is to have only one model in the memory, so it will not consume
more than is really needed. Is it possible with some easy hack?

EasyDiffusion · September 8, 2022, 6:29pm

Pipe could use a more global manager, with caching to dramatically reduce footprint when not in use after setup, or after a run. But what you should do in the meantime if you’re switching a pipe, is just del pipe and flush torch memory and do a garbage cleanup. Then you can load up the img2img pipe if a init is available, keeping only one in memory.

Another option is to, also do del pipe method, but to employ joblib and dump the pipe to disk, and load it on demand. So you could load up all your pipes, and cache them to disk, and load them just before diffusion, then either dump again (in case of param changes) or just del and load from the same cache before another run. Switch to a img2img cache when init is detected.

The pipe will take same amount of time to set up initially before it’s dumped, but when it loads a pipe, it will take under a second and be ready to feed prompts, etc, and start a run.

dkackman · September 16, 2022, 1:58am

A version of what’s described above in case it helps anyone:

github.com

dkackman/fing/blob/main/src/generator/pipelines.py

import torch
import logging
from diffusers import StableDiffusionPipeline, StableDiffusionImg2ImgPipeline, StableDiffusionInpaintPipeline
import pickle


def preload_pipelines(model_name, auth_token, pipeline_names = ["txt2img", "img2img", "imginpaint"]):
    # this will preload all the pipelines and serialize them to disk
    # get_pipeline will then retrive from disk, accomplishing two things:
    # 1 - pay the startup cost to get the model form hugging face only 1 time per process
    # 2 - keep them out of RAM (main and GPU) until actually needed
    # on demand they get deserialized and pushed to the gpu
    #
    # TODO #8 model the GPU as a class; including what pipeline is loaded and if it has a workload or not
    if "txt2img" in pipeline_names:
        logging.debug("Loading txt2img")
        pipeline = StableDiffusionPipeline.from_pretrained(
            model_name,
            revision="fp16",
            torch_dtype=torch.float16,

This file has been truncated. show original

pcuenq · September 23, 2022, 12:13pm

You may also find a related GitHub discussion interesting, in particular the code snippet in this comment.

Topic		Replies	Views
Access CLIP from StableDiffusionPipeline and use the same models for multiple pipelines 🧨 Diffusers	3	2606	October 11, 2023
Generating and saving multiple images using img2img pipeline 🧨 Diffusers	4	13100	February 8, 2023
Img2Img keeps devolving into psychedelics Beginners	0	540	September 28, 2022
SDXL Image to Image, howto 🧨 Diffusers	8	11171	June 1, 2025
How to optimize inference of stable diffusion model when the images generated are of different seed but with same prompt? 🧨 Diffusers	2	1432	February 7, 2024

How to have one pipeline to perform text2img, img2img with shared Stable Diffusion model?

Related topics