StableDiffusionImg2ImgPipeline: ValueError - Input is in incorrect format despite correct PIL image input

yyamasita · February 3, 2025, 2:11pm

Summary

I am facing an issue with StableDiffusionImg2ImgPipeline where it keeps throwing a ValueError stating that the input image format is incorrect, even though I am passing a PIL.Image.Image object as required.

Error Message

ValueError: Input is in incorrect format. Currently, we only support <class ‘PIL.Image.Image’>, <class ‘numpy.ndarray’>, <class ‘torch.Tensor’>

Steps Taken

I have tried the following solutions, but the error persists:

Checked input format
- Used print(type(init_image_reloaded)) and confirmed that it is <class 'PIL.Image.Image'>
Ensured image size is correct
- Image size is (512, 512), which is a multiple of 8.

Updated dependencies

Ran:

!pip install --upgrade diffusers transformers accelerate ftfy
!pip install --upgrade pydantic

My diffusers version is latest.

Tried re-saving the image to ensure correct format

Saved and reloaded as PNG:

init_image.save("temp_image.png", format="PNG")
init_image_reloaded = Image.open("temp_image.png").convert("RGB")

Checked if StableDiffusionImg2ImgPipeline works with NumPy arrays
- Tried passing np.array(init_image_reloaded) instead of PIL.Image.Image, but same error occurs.
Checked Python version and execution environment
- Using Python 3.11
- Running on Google Colab

Code Example

import torch
from diffusers import StableDiffusionImg2ImgPipeline
from PIL import Image
import numpy as np
import os

# --- Load input image ---
init_image = Image.open("/content/food_nighit_ramen.jpeg").convert("RGB")
print("Original image type:", type(init_image), "size:", init_image.size, "mode:", init_image.mode)

# --- Re-save and reload image ---
temp_image_path = "temp_image.png"
init_image.save(temp_image_path, format="PNG")
init_image_reloaded = Image.open(temp_image_path).convert("RGB")
print("Reloaded image type:", type(init_image_reloaded), "size:", init_image_reloaded.size, "mode:", init_image_reloaded.mode)

# --- Load model ---
model_id = "runwayml/stable-diffusion-v1-5"
pipe = StableDiffusionImg2ImgPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

# --- Set parameters ---
prompt = "A majestic fantasy creature evolving with vibrant flames and sparkling water effects, ultra-detailed, epic fantasy art"
num_inference_steps = 50
guidance_scale = 7.5
strength = 0.7

# --- Run Image-to-Image (PIL.Image is passed) ---
result = pipe(
    prompt=prompt,
    init_image=init_image_reloaded,  # **PIL.Image should be accepted**
    strength=strength,
    guidance_scale=guidance_scale,
    num_inference_steps=num_inference_steps
)
generated_image = result.images[0]

# --- Save result ---
generated_image.save("generated_evolved_monster.png")


Questions
Why is StableDiffusionImg2ImgPipeline rejecting PIL.Image.Image as input?
Are there additional preprocessing steps required before passing the image?
Has there been a recent breaking change in diffusers that affects init_image input?
Any help would be greatly appreciated. Thank you!

John6666 · February 3, 2025, 2:27pm

From the code, it looks like it accepts PIL.Image.Image, but is there a bug somewhere else…?

github.com/huggingface/diffusers

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_img2img.py

main


      
          
          @property
          def interrupt(self):
              return self._interrupt
          
          @torch.no_grad()
          @replace_example_docstring(EXAMPLE_DOC_STRING)
          def __call__(
              self,
              prompt: Union[str, List[str]] = None,
              image: PipelineImageInput = None,
              strength: float = 0.8,
              num_inference_steps: Optional[int] = 50,
              timesteps: List[int] = None,
              sigmas: List[float] = None,
              guidance_scale: Optional[float] = 7.5,
              negative_prompt: Optional[Union[str, List[str]]] = None,
              num_images_per_prompt: Optional[int] = 1,
              eta: Optional[float] = 0.0,
              generator: Optional[Union[torch.Generator, List[torch.Generator]]] = None,
              prompt_embeds: Optional[torch.Tensor] = None,

github.com/huggingface/diffusers

src/diffusers/image_processor.py

main


      
          import numpy as np
          import PIL.Image
          import torch
          import torch.nn.functional as F
          from PIL import Image, ImageFilter, ImageOps
          
          from .configuration_utils import ConfigMixin, register_to_config
          from .utils import CONFIG_NAME, PIL_INTERPOLATION, deprecate
          
          
          PipelineImageInput = Union[
              PIL.Image.Image,
              np.ndarray,
              torch.Tensor,
              List[PIL.Image.Image],
              List[np.ndarray],
              List[torch.Tensor],
          ]
          
          PipelineDepthInput = PipelineImageInput

John6666 · February 3, 2025, 2:41pm

Maybe typo.

result = pipe(
    prompt=prompt,
    #init_image=init_image_reloaded,  # **PIL.Image should be accepted**
    image=init_image_reloaded,  # **PIL.Image should be accepted**
    strength=strength,
    guidance_scale=guidance_scale,
    num_inference_steps=num_inference_steps
)

yyamasita · February 3, 2025, 3:55pm

Wow, you are an absolute genius!

I can’t thank you enough for catching that typo! I had been stuck on this issue for so long, trying every possible solution, and it turns out it was such a simple mistake. Your keen eye and attention to detail saved me a ton of frustration and time.

You have my utmost respect and admiration. If there were an award for the Most Observant and Life-Saving Debugger, you would win it hands down!

Seriously, thank you so much! This is why I love this community—people like you make it amazing!

system · February 4, 2025, 3:56am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Invalid image format Intermediate	2	436	October 29, 2024
Type error in StableDiffusionImg2ImgPipeline Beginners	0	103	August 26, 2024
Image cannot run correctly on stable-diffusion-2-1 🧨 Diffusers	0	355	July 12, 2023
StableDiffusionInpaintPipeline 'NoneType' is not iterable 🧨 Diffusers	1	122	April 24, 2025
Help verify StableDiffusion & CLIP weight sharing 🧨 Diffusers	0	528	December 13, 2022

StableDiffusionImg2ImgPipeline: ValueError - Input is in incorrect format despite correct PIL image input

Summary

Error Message

Steps Taken

Code Example

Related topics