SegformerImageProcesser only supports uint8 masks

dsjstc · November 2, 2023, 8:06pm

I have a semantic segmentation problem with several hundred classes. It appears that SegformerImageProcesser needs to be able to trivially convert masks to an 8-bit PIL. If I pass in an RGB pil, then I get pixel_values.shape = (3,512,512), and labels.shape = (512,512,3). That makes me think this isn’t an intended usage.

AFAIK the only transform I need to apply to the mask is a resize, so I can easily do that on my own, but it seems like an odd limitation.

Am I misunderstanding something here?

Topic		Replies	Views
SegformerImageProcessor introducing new labels 🤗Transformers	0	679	April 17, 2023
Binary semantic segmentation using SegFormer 🤗Transformers	6	3484	July 26, 2023
How do you use segmentation image processor with more than 3 channel images? Beginners	1	298	May 13, 2024
Fine tuning segformer model Beginners	0	236	September 7, 2022
Segformer -encode images with more 3 channels 🤗Transformers	0	860	January 16, 2023

SegformerImageProcesser only supports uint8 masks

Related topics