Why TrOCR processor has a feature extractor?

nielsr · November 17, 2021, 8:31am

Hi,

Yes models that take pixel values as an input have a feature extractor defined, that will apply some basic image preprocessing (typically resize the image to a particular size + normalize the color channels).

TrOCR for instance expects every image to be of size 224x224.

Note that many models show better performance by introducing image augmentations (such as random flipping, cropping, etc.) during training. This is not included in the feature extractors, for that you can use packages like torchvision or albumentations.

Topic		Replies	Views
Extract visual and contextual features from images Models	5	4463	August 27, 2021
Get original image from trocr processor Intermediate	1	682	October 10, 2022
Processor while fine-tuning TrOCR on IAM 🤗Transformers	0	217	November 28, 2023
Finetuning TrOCR on the IAM dataset 🤗Transformers	1	1139	August 11, 2022
Error finding processor's image class. Loading based on pattern matching with feature extractor 🤗Transformers	11	12750	October 27, 2023

Why TrOCR processor has a feature extractor?

Related topics