Can I use Dataloader for image and text processing with ViltProcessor?

shantu95 · January 11, 2023, 8:37pm

I wanted to fine-tune ViLT(Vision Language Model) for my task. In my dataset, I have 10 images with 1 text. For ViltForImagesAndTextClassification, I can increase the number of images using ViltConfig. But I am not able to preprocess the dataset using ViltProcessor through a Dataloader.

Is it possible to pass images and text in a Batch to ViLTProcessor? If possible, Can anyone help me how to do that?

Thanks in advance.

Topic		Replies	Views
Use batching for ViLT predictions Beginners	1	326	February 4, 2022
Can Processors/FeatureExtractors be used within custom DataCollators or DataLoaders? 🤗Transformers	0	378	April 21, 2022
Why Fine-Tune a ViLT model For Images And Text Classification is showing out of index error? 🤗Transformers	4	458	January 16, 2023
Multilingual Visual Question Answering Flax/JAX Projects	8	905	July 2, 2021
How to use a data collator when dealing with text and images 🤗Transformers	0	1116	March 6, 2023

Can I use Dataloader for image and text processing with ViltProcessor?

Related topics