VITMAE training

vipinbansal1 · August 13, 2024, 3:45pm

Hi,
I am trying to perform incremental training on VITMAE using my own dataset which have around 150 images.
I am just perfroming VITAutoEncoder model training and have explore all the three variants with different number of epochs.
I have use transfer learning, i.e. used pretrained weights and does incremental training and trained from the random weights state also.

Issue that i am facing is generated images grids are not smooth and can clearly see boundaries on each sub block of the image.
I am not sure how to fix this issue. I am exploring overlapping grids, looking forward for expert support on this.

Topic		Replies	Views
How to build VitMAE encoder with Unet Decoder for semmantic segmantation Beginners	0	79	June 12, 2024
Incremental learning for image captioning 🤗Transformers	3	84	October 1, 2024
I'm failing to train a vit_base_patch16_224 model for creating high quality embeddings for screenshots Models	0	36	September 5, 2024
Img2seq model with pretrained weights Beginners	7	1216	November 18, 2021
How to use I-JEPA for image classficiation 🤗Transformers	4	1957	December 6, 2024

VITMAE training

Related topics