Inference Segformer

janakipanneerselvam · March 20, 2024, 6:22pm

I finetuned a nvidia/mit-b5 for semantic segmentation. Training went well but during the inference is predicting the whole picture a mask?? I can’t correlate what I did wrong.

First, rescale logits to original image size

upsampled_logits = nn.functional.interpolate(logits,
size=image.shape[:-1], # (height, width)
mode=‘bilinear’,
align_corners=False)

Second, apply argmax on the class dimension

seg = upsampled_logits.argmax(dim=1)[0]
seg = seg.cpu().numpy()
print(np.unique(np.array(seg), return_counts=True))

there is only one value “1” for the whole image.

Can someone please help me? Thanks

Topic		Replies	Views
SegFormer image segmentation inference do depends of resolution Intermediate	0	29	February 19, 2025
SAM image size for fine-tuning Intermediate	5	6166	April 3, 2024
Model for image regression 🤗Transformers	0	212	April 13, 2024
Exploring Segformer but its giving out Value error for input size, and expects to be 128x128 🤗Transformers	3	597	July 19, 2022
SegformerImageProcesser only supports uint8 masks 🤗Transformers	0	135	November 2, 2023

Inference Segformer

First, rescale logits to original image size

Second, apply argmax on the class dimension

Related topics