Why are only 2 of the RT-DETR v2 implemented losses actually used?

topagrume · April 16, 2025, 11:29am

Hi,

I’ve been studying the RTDetrLoss implementation in the transformers library and noticed something interesting. The class implements several loss functions:

loss_labels_vfl
loss_labels (cross-entropy)
loss_cardinality
loss_boxes
loss_masks
loss_labels_bce
loss_labels_focal

However, looking at the code, it seems only “vfl” and “boxes” losses are actually used in practice (these are the only ones included in self.losses = ["vfl", "boxes"]).

I’m curious about why the other loss functions are implemented but not utilized. Is this consistent with the RT-DETR paper’s implementation? Or perhaps were the other losses tested empirically but found to be less effective?

Has anyone experimented with enabling the other loss functions like the focal loss or BCE loss? I’d appreciate any insights about the design choices here.

Hope you guys can help! Thank you.

topagrume · May 2, 2025, 3:55pm

If anyone has an answer it would be very helpful.

John6666 · May 2, 2025, 8:10pm

I did some quick research at the time, but couldn’t find any useful information. It seems that those parameters already existed in RT-DETR (not v2), but I couldn’t figure out why they were implemented.

github.com/huggingface/transformers

New model support RTDETR

main ← SangbumChoi:rtdetr

opened 08:34AM - 17 Feb 24 UTC

SangbumChoi

+6892 -9

# What does this PR do? This is the new model for RTDETR that is complete ver…sion from https://github.com/huggingface/transformers/pull/27247 There are several TO DOs - [X] reslove conflicts - [X] weight files for other 7 RTDETR - [X] Edit testing script - [X] (optional) enable training ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case). - [X] Did you read the [contributor guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#create-a-pull-request), Pull Request section? - [ ] Was this discussed/approved via a Github issue or the [forum](https://discuss.huggingface.co/)? Please add a link to it if that's the case. - [X] Did you make sure to update the documentation with your changes? Here are the [documentation guidelines](https://github.com/huggingface/transformers/tree/main/docs), and [here are tips on formatting docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation). - [X] Did you write any new necessary tests? ## Who can review? Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR. @amyeroberts @NielsRogge

topagrume · May 5, 2025, 9:15am

ok thank you

Topic		Replies	Views
How was self.loss_function implemented 🤗Transformers	4	30	June 9, 2025
Why is BCELoss used for multi-label classification? 🤗Transformers	4	373	October 12, 2024
Transformers replacing loss function 🤗Transformers	0	3370	March 26, 2022
Custom loss function forward vs. custom_loss Beginners	2	3014	August 11, 2022
Custom_loss fn for token_classification Beginners	3	344	November 6, 2024

Why are only 2 of the RT-DETR v2 implemented losses actually used?

Related topics