Generating an image is really slow

jthulhu · August 8, 2024, 4:43pm

Some context

I’m trying to generate an image from a textual prompt, and compare the various options available for this task, because I am very new to this, so I don’t know which one is most fit to my use case. I’d like to avoid having to train anything myself, and rather resort to a off-the-shelf solution. If I understood correctly, this is roughly what pretrained models available on the hugging face hub are for. I tried downloading one (chosen more or less at random, I picked alvdansen/phantasma-anime · Hugging Face), but I didn’t manage to actually load it and run it with a prompt to get an image out of it (all the from_pretrained class methods I’ve tried want a JSON config file, not a safetensors file). So, I started with a simpler task, to simply run on my computer the example given in the documentation of text-to-image diffusers:

import torch
from diffusers import StableDiffusionPipeline
pipe = StableDiffusionPipeline.from_pretrained("runwayml/stable-diffusion-v1-5", torch_dtype=torch.float16)
image = pipe("an image of an astronaut riding a horse on Mars").images[0]

The problem

I’m doing this on a laptop that doesn’t have a (dedicated) GPU, so I knew that I wouldn’t have the results right away, but I expected a single iteration to be performed at most in a couple of minutes. Yet, after ten minutes, a single iterations hasn’t finished.

I was wondering if it was normal that this task is that long, considering that it defaults to 50 iterations, so that generating a single image at this rate would take at least 9 hours.

Specs

For information, my CPU’s model is i7-1260P. The environment, as spit out by transformers-cli env, is

transformers version: 4.43.3
Platform: Linux-6.6.43-x86_64-with-glibc2.39
Python version: 3.11.9
Huggingface_hub version: 0.24.2
Safetensors version: 0.4.3
Accelerate version: 0.32.0
Accelerate config: not found
PyTorch version (GPU?): 2.3.1 (False)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed

Also, during the generation of the image, I noted that it mostly uses only two cores, although it occasionally uses all the 16 cores. I was wondering whether there was a way to make it use more aggressively the resources of my machine.

Topic		Replies	Views
How to use fine tuned a pre-trained text to image model? 🧨 Diffusers	0	41	August 22, 2024
Beginner questions for the diffuers library, especially what model is fast and tiny Beginners	0	272	November 8, 2023
Why is the diffusers library slow? 🧨 Diffusers	0	781	March 15, 2023
How to optimize inference of stable diffusion model when the images generated are of different seed but with same prompt? 🧨 Diffusers	2	1430	February 7, 2024
Creation of Images from Text-Prompt Beginners	1	80	December 31, 2024

Generating an image is really slow

Some context

The problem

Specs

Related topics