When we fine-tune an object detection model, which weights exactly are we updating?
Do we explicitly configure which components of the model we want to update?
For example, in fine-tuning DETR, which components are we updating?
I would assume that the backbone is being frozen, and that encoder/decoder get updated, the prediction heads get reset and trained from scratch
(I am asking specifically about DETR because it is a popular and simple object detection dataset)
Thanks!