Object Detection with images of different sizes

amogkam · May 25, 2023, 1:04am

Trying to do Object detection on a batch of images with different sizes.

I have 10 images in my batch:

from transformers import pipeline
from PIL import Image

# If doing CPU inference, set device="cpu" instead.
obj_detector = pipeline("object-detection", model="facebook/detr-resnet-50", device="cuda:0", do_pad=False)
outputs = obj_detector([Image.fromarray(image_array) for image_array in batch], top_k=1, batch_size=10)

batch is a list of image numpy arrays with each image having a different height and width.

But run into this issue

RuntimeError: The expanded size of the tensor (1066) must match the existing size (800) at 
non-singleton dimension 1.  Target sizes: [1066, 1066].  Tensor sizes: [1066, 800]

Topic		Replies	Views
How to perform batch inference on GroundingDino model 🤗Transformers	2	674	July 25, 2024
`target_sizes` and `output.logits` do not align in `image_processor.post_process_object_detection` 🤗Transformers	0	52	September 3, 2024
Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length Beginners	1	1354	November 6, 2024
Trainer API object detection 🤗Transformers	2	51	December 29, 2024
Potential bug in the rt-detr v2 fine tune script 🤗Transformers	5	306	July 29, 2025

Object Detection with images of different sizes

Related topics