Help making object detection dataset

Hmm… This seems difficult for me. @lhoestq