Proper way to swap backbones from baseline model

Cubpaw · July 6, 2024, 2:35pm

Hi, I’m currently training a segmentation model, and this is how I initiate my model and change my backbone before the training:

if config[“model_architecture”].lower() == “upernet”:
model_args[“loss_ignore_index”] = model_args.pop(“semantic_loss_ignore_index”)
del model_args[“image_size”]
MODEL_INITIALIZER = UperNetForSemanticSegmentation

Load the new backbone from the checkpoint
if config[“backbone_checkpoint_model”] is not None:
new_backbone = AutoBackbone.from_pretrained(new_backbone_dir, out_indices=(1,2,3,4))
# Replace the existing backbone with the new one
model.backbone = new_backbone
# Optionally, if you need to adjust the configuration
model.config.backbone_config = new_backbone.config

For the other backbones (convnext, convnextv2) this approach seems to work without any issues, but for swinv2, even though the training runs without any errors (I guess this means at least the dimensions match correctly) swinv2 model does not seem to learn under this setting.

Is this the way how the backbones are usually swapped? or am I doing it wrong?
Any sort of input would be appreciated! Thanks!

Topic		Replies	Views
How to Modify UperNetForSemanticSegmentation from 150 Classes to Binary Classes While Retaining Pre-Trained Weights Models	0	54	September 4, 2024
Loading only pre-trained backbone for Mask2Former 🤗Transformers	0	207	April 8, 2024
KeyError Convert SWIN to Pytorch 🤗Transformers	0	210	August 9, 2023
Using detr with custom backbone Models	3	623	December 6, 2024
Swin Transformer for segmentation Beginners	1	2151	November 3, 2022

Proper way to swap backbones from baseline model

Related topics