Any examples on VisualBERTforMultipleChoice

Petrus · March 1, 2022, 7:12am

Hi,

Would anyone know any examples on how to use VisualBERTforMultipleChoice, or any similar examples? I am mostly looking for an example that can showcase how I need to tokenize my text data and perform visual feature extraction of my images, as well as how to input my multi-class labels to the model.

I would like to build something similar to this paper using radiology images and text reports and train a model to predict 14 classes (thoracic diagnosis):

Here is the hugging face transformer model I plan to use:

If someone would have a good example on how to do this with hugging face please share. Thanks!

Petrus · March 3, 2022, 9:16am

Actually, if anyone would be able to share an example of a vision language transformer model? Preferably trained on a multi-classification problem, but any task would be helpful. Even better, if it uses the SageMaker API. Thank you!

Topic		Replies	Views
How to combine images and text in SageMaker Amazon SageMaker	2	2272	October 13, 2022
NLP Sense Making Beginners	0	421	March 31, 2022
Resources for Sign Language Translation Beginners	0	1650	August 18, 2020
Suggestions for hugging face transformer models for Code and Formal Languages Intermediate	2	1754	May 3, 2022
Multimodal transformer Models	0	1069	April 23, 2023

Any examples on VisualBERTforMultipleChoice

Related topics