Changing resolution of transformer models for training

moshel · September 2, 2022, 3:53am

I would like to train a transformer on images 512x512. I was looking at the huggingpics notebook as a starting point but failed to modify it to train on images with size different from 224x224.
I tried adding the “size” to ViTFeatureExtractor.from_pretrained (unknown keyword) and to ViTForImageClassification.from_pretrained (passed, but when it reached the trainer.fit fails with “ValueError: Input image size (512512) doesn’t match model (224224).”).

While i know how to do that in plain vanilla keras and pytorch, the hugginingface is just too user friendly for me!

Any help will be greatly appreciated.

Topic		Replies	Views
Fine tuning image transformer on higher resolution Beginners	11	7917	May 1, 2024
Pyramid Vision Transformer: Issue with input image size larger than 224 px 🤗Transformers	0	1560	September 15, 2023
How do I change image size and patch size in Tensorflow Beginners	0	248	December 14, 2023
Fine-tuning ViT with more patches/higher resolution Intermediate	3	3635	December 26, 2022
Serious issue regarding channel dimensions with respect to configuration during training a vision transformer Beginners	2	517	August 26, 2024

Changing resolution of transformer models for training

Related topics