Image classification: Why use both a transform and a processor to preprocess images?

mahmutc · September 11, 2024, 11:02am

hi @Steyn-vanLeeuwen
I’m quoting from the tutorial

You might wonder why we pass along the image_processor as a tokenizer when we already preprocessed our data. This is only to make sure the image processor configuration file (stored as JSON) will also be uploaded to the repo on the hub.

I think you’re not doing two different preprocessing. You just take some numbers from image_processor to pass to other two functions:

normalize = Normalize(mean=image_processor.image_mean, std=image_processor.image_std)
if "height" in image_processor.size:
    size = (image_processor.size["height"], image_processor.size["width"])
    crop_size = size
    max_size = None
elif "shortest_edge" in image_processor.size:
    size = image_processor.size["shortest_edge"]
    crop_size = (size, size)
    max_size = image_processor.size.get("longest_edge")

For inference you can apply just image_processor as explained in the tutorial.

Topic		Replies	Views
ConvNextImageProcessor weird resize behaviour when input image is 224x224 🤗Transformers	2	61	September 10, 2024
Why use `val_transforms()` function in image classification example instead of `feature_extractor`? Intermediate	0	391	July 4, 2022
What is ViTImageProcessor doing? Intermediate	3	1716	April 18, 2024
Image dataset with_transform not applied Beginners	1	138	July 25, 2024
Should I use .map(processor) or define tokenizer=processor? 🤗Transformers	0	187	November 7, 2023

Image classification: Why use both a transform and a processor to preprocess images?

Related topics