What bounding boxes format does Grounding DINO use?

John6666 · July 5, 2025, 9:16am

I think it will be converted internally if you pass it in this format.

The image_processor expects the annotations to be in the following format: {'image_id': int, 'annotations': list[Dict]}, where each dictionary is a COCO object annotation.

github.com/huggingface/transformers

src/transformers/image_utils.py

main


      
              """
              height, width = image_size
              height_scale = max_height / height
              width_scale = max_width / width
              min_scale = min(height_scale, width_scale)
              new_height = int(height * min_scale)
              new_width = int(width * min_scale)
              return new_height, new_width
          
          
          def is_valid_annotation_coco_detection(annotation: dict[str, Union[list, tuple]]) -> bool:
              if (
                  isinstance(annotation, dict)
                  and "image_id" in annotation
                  and "annotations" in annotation
                  and isinstance(annotation["annotations"], (list, tuple))
                  and (
                      # an image can have no annotations
                      len(annotation["annotations"]) == 0 or isinstance(annotation["annotations"][0], dict)
                  )
              ):

Topic		Replies	Views
GroundingDINO dataset format Beginners	1	36	December 30, 2024
Owl-Vit postprocess API bbox conversion Beginners	5	343	February 9, 2024
Load a COCO format database from disk for DETR 🤗Datasets	4	99	May 14, 2025
Documentation script for fine-tuning Mask2Former with Trainer does not support instance segmentation with superposed instances 🤗Transformers	3	62	March 2, 2025
Prepare dataset from YOLO format to COCO for DETR 🤗Transformers	4	5152	May 6, 2025

What bounding boxes format does Grounding DINO use?

Related topics