Invalid image format

ammarfunprime · October 15, 2024, 10:40am

I’m encountering a persistent issue when running the StableDiffusionInpaintPipeline for an inpainting task. Despite passing inputs in the expected formats (both the image and mask are in PIL.Image.Image format with correct sizes), I keep receiving the following error:

ValueError: Input is in incorrect format. Currently, we only support <class 'PIL.Image.Image'>, <class 'numpy.ndarray'>, <class 'torch.Tensor'>

Here’s the code that triggers the error:

# Image and mask setup
image_pil = Image.fromarray(image_np)
mask_pil = Image.fromarray(black_mask).convert("L")

# Generator for reproducibility
generator = torch.Generator(device="cuda").manual_seed(0)

image = model["pipeline"](
    prompt=prompt,
    negative_prompt=IMG_INPAINTING_NEG_PROMPT,
    image=image_pil,  # PIL Image
    mask=mask_pil,    # Grayscale mask (mode "L")
    guidance_scale=8.0,
    num_inference_steps=50,
    generator=generator,
).images[0]

Image and Mask Details:

Image size: (512, 768), mode: RGB
Mask size: (512, 768), mode: L
The mask is binary (contains only 0 and 255 values).

I’ve also tried using a simple manually created mask to ensure that FastSAM-generated masks aren’t causing the issue, but I still get the same error.

John6666 · October 15, 2024, 10:59am

It looks like you are stuck here, but I think this is a bug in Diffusers…?

github.com

huggingface/diffusers/blob/main/src/diffusers/image_processor.py

# Copyright 2024 The HuggingFace Team. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

import math
import warnings
from typing import List, Optional, Tuple, Union

import numpy as np
import PIL.Image

This file has been truncated. show original

def is_valid_image(image):
    return isinstance(image, PIL.Image.Image) or isinstance(image, (np.ndarray, torch.Tensor)) and image.ndim in (2, 3)

Maybe this is correct.

def is_valid_image(image):
    return isinstance(image, PIL.Image.Image) or (isinstance(image, (np.ndarray, torch.Tensor)) and image.ndim in (2, 3))

ndim is not an element of PIL.Image.Image.

github.com

python-pillow/Pillow/blob/main/src/PIL/Image.py

#
# The Python Imaging Library.
# $Id$
#
# the Image class wrapper
#
# partial release history:
# 1995-09-09 fl   Created
# 1996-03-11 fl   PIL release 0.0 (proof of concept)
# 1996-04-30 fl   PIL release 0.1b1
# 1999-07-28 fl   PIL release 1.0 final
# 2000-06-07 fl   PIL release 1.1
# 2000-10-20 fl   PIL release 1.1.1
# 2001-05-07 fl   PIL release 1.1.2
# 2002-03-15 fl   PIL release 1.1.3
# 2003-05-10 fl   PIL release 1.1.4
# 2005-03-28 fl   PIL release 1.1.5
# 2006-12-02 fl   PIL release 1.1.6
# 2009-11-15 fl   PIL release 1.1.7
#

This file has been truncated. show original

Currently, it should be possible to slip through this check by passing it in numpy format.

@sayakpaul I found a crappy bug in Diffusers.

John6666 · October 29, 2024, 11:48am

I opened PR on github.

github.com/huggingface/diffusers

Update image_processor.py

huggingface:main ← John6666cat:patch-1

opened 11:46AM - 29 Oct 24 UTC

John6666cat

+1 -1

`PIL.Image.Image` doesn't have `dim`. See: https://discuss.huggingface.co/t/…invalid-image-format/112175 # What does this PR do?  Fixes # (issue) ## Before submitting - [x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [x] Did you read the [contributor guideline](https://github.com/huggingface/diffusers/blob/main/CONTRIBUTING.md)? - [x] Did you read our [philosophy doc](https://github.com/huggingface/diffusers/blob/main/PHILOSOPHY.md) (important for complex PRs)? - [ ] Was this discussed/approved via a GitHub issue or the [forum](https://discuss.huggingface.co/c/discussion-related-to-httpsgithubcomhuggingfacediffusers/63)? Please add a link to it if that's the case. - [ ] Did you make sure to update the documentation with your changes? Here are the [documentation guidelines](https://github.com/huggingface/diffusers/tree/main/docs), and [here are tips on formatting docstrings](https://github.com/huggingface/diffusers/tree/main/docs#writing-source-documentation). - [ ] Did you write any new necessary tests? ## Who can review? Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

Topic		Replies	Views
StableDiffusionImg2ImgPipeline: ValueError - Input is in incorrect format despite correct PIL image input 🧨 Diffusers	4	205	February 3, 2025
StableDiffusionInpaintPipeline Tensor Input Error 🧨 Diffusers	2	1441	November 9, 2022
ValueError: Invalid image type. Expected either PIL.Image.Image, numpy.ndarray, torch.Tensor, tf.Tensor or jax.ndarray, but got 🤗Transformers	6	4221	January 5, 2024
StableDiffusionInpaintPipeline 'NoneType' is not iterable 🧨 Diffusers	1	116	April 24, 2025
Help with image inpainting Beginners	3	32	April 1, 2025

Invalid image format

Related topics