Hi. I would like to compare two nets using the same dataset, regardless being Transformer-based (DETR) vs Non-Transformer based (YOLOv5).
I have already trained a model using Yolov5, such that my dataset is already split into train-val-test, in YOLO format. See Formatting table to visualize an example. My dataset folder looks like this:
.
βββ train
βββ images
β βββ ima1.png
β βββ ima2.png
β βββ ...
βββ labels
β βββ ima1.txt
β βββ ima2.txt
β βββ ...
βββ val
βββ images
β βββ ima3.png
β βββ ima4.png
β βββ ...
βββ labels
β βββ ima3.txt
β βββ ima4.txt
β βββ ...
βββ test
βββ images
β βββ ima5.png
β βββ ima6.png
β βββ ...
βββ labels
β βββ ima5.txt
β βββ ima6.txt
β βββ ...
Now I want to convert it to COCO format. From Hugging Face documentation, DETR demands COCO format in labels, using JSON files. However, you are using a dataset loaded from Hugging Face datasets library. Moreover, I would like to know if I should create 3 JSON files, for each split, or 1 JSON file containing all. In the latter case, could you provide some documentation on how should the JSON file be defined?
If there is any tutorial on how to prepare the data to feed DETR, based on my specs, it would be nice to post it here.
Thank you for all!