Object detection resolution fine-tuning

John6666 · July 14, 2025, 7:19am

Is it (1) possible

As long as the processor works properly, there shouldn’t be any major problems. Some models, such as CLIP, seem to have hard-coded resolutions, but otherwise like this should be fine.

from transformers import DetrImageProcessor

image_processor = DetrImageProcessor.from_pretrained(
    "facebook/detr-resnet-50",
    do_resize=True,
    size={"height": 540, "width": 960},   # ← your 16:9 resolution
    default_to_square=False,
    do_pad=True,
    pad_size={"height": 540, "width": 960},
)

and is it a good idea (2)

There does not seem to be much of a negative impact on accuracy. However, since the existing weights are learned as squares, it may be necessary to perform thorough tuning on your own.

Topic		Replies	Views
When Fine-tuning a object detection model which parameters do we update? 🤗Transformers	1	19	July 10, 2025
Example DeTr Object Detectors not predicting after fine tuning Beginners	6	1397	May 9, 2024
RT-DETRV2 and normalization 🤗Transformers	0	10	July 10, 2025
How to fine tune DiT for object detection? 🤗Transformers	1	1751	November 27, 2023
Fine tuning image transformer on higher resolution Beginners	11	7885	May 1, 2024

Object detection resolution fine-tuning

Related topics