Vision Transfomer issue with broadcasting shapes

anon76594689 · November 22, 2022, 10:00pm

Hello,

I am using image transformers from HF but although my images are RGB, I keep getting the following error:

operands could not be broadcast together with shapes (224,224) (3,) (224,224) .

Any idea on how I can dynamically fix this? Should I do this using collate?

nielsr · November 23, 2022, 8:00am

Hi,

Could you provide a code snippet to reproduce your error?

You’re sure that you did image.convert(“RGB”)?

anon76594689 · November 23, 2022, 2:12pm

I added the following portion:

data = load_dataset(“venetis/VMMRdb_make_model”)

def transforms(examples):
examples[“image”] = [image.convert(“RGB”) for image in examples[“image”]]

data = data.map(transforms,batched=True)

and it seems to produce the following error:
UnidentifiedImageError: cannot identify image file <_io.BytesIO object at 0x7f0b3a27dd70>

samayl24 · April 24, 2024, 7:44pm

Same problem.

nielsr · April 29, 2024, 10:02am

I usually get this error when the image is not an appropriate file.

Topic		Replies	Views
ValueError: operands could not be broadcast together with shapes (1,2048,51200) (20,2,1,16,2048,64) Beginners	1	715	April 4, 2023
ValueError: operands could not be broadcast together with shapes (1,2048,51200) (20,2,1,16,2048,64) Beginners	0	689	April 2, 2023
How to use Trainer with Vision Transformer Beginners	3	1690	October 19, 2021
ValueError: could not broadcast input array from shape (30,512,32128) into shape (30,512) 🤗Transformers	2	2449	February 13, 2024
Wrong tensor shape when using a model: TypeError: Cannot handle this data type: (1, 1, 1280, 3), \|u1 Beginners	3	1474	January 9, 2024