Can you provide an example of best practices for incorporating a pretrained HuggingFace Vision Transformer (ViT) into a PyTorch Lightning module?

Jhon-doe12 · September 9, 2024, 10:45am

While there are numerous examples and notebooks showing how to run and fine-tune pretrained models like Vision Transformers (ViT), I’m looking for a clear example of how to integrate a pretrained ViT into a PyTorch Lightning pipeline. Specifically:

Should I instantiate the AutoImageProcessor() class within my pl.LightningModule, or would it be better to do so in my pl.DataModule?
Should I implement my own forward method in the LightningModule, or should I simply call the forward method of the pretrained model (which would be an attribute of my Lightning class)?

Topic		Replies	Views
Any best practices example on integrating a pretrained HuggingFace ViT into a pytorch lightning module? Models	5	4884	September 8, 2024
Data augmentation for image (ViT) using Hugging Face Beginners	9	5984	December 10, 2021
Training General Pytorch model with HuggingFace's Trainer 🤗Transformers	0	386	May 7, 2023
Saved pytorch lightning / hugging face model is not loading properly Beginners	1	1136	June 25, 2022
.pt PyTorch Model ->PreTrainedModel Beginners	4	780	May 1, 2024

Can you provide an example of best practices for incorporating a pretrained HuggingFace Vision Transformer (ViT) into a PyTorch Lightning module?

Related topics