Fine tuning image transformer on higher resolution

Hi one can achieve this as explained here: Fine-tuning ViT with more patches/higher resolution - #4 by mohotmoz