VisualBert Embeddings

torium · April 13, 2024, 3:38pm

Hi! I am trying to fine tune VisualBert for a classification task but right now it is randomly predicting only one of the two classes that I have. I am thinking that it might be the way I am retrieving the visual embeddings. I am using resnet50 and I get the features from this line of code:
detector = torchvision.models.resnet50(pretrained=True) detector = torch.nn.Sequential(*list(detector.children())[:-1])

does anyone know if these embeddings actually work with VisualBert? I read that it typically needs embeddings from an object detector but since I am only classifying image-sentence pairs I thought that this network could also work. Thanks!

Topic		Replies	Views
VisualBert model producing RuntimeError 🤗Transformers	7	458	December 22, 2023
VisualBert for meme classification Beginners	0	250	December 7, 2021
Sentence Embeddings From Fine-Tuned BERTForSequenceClassification 🤗Transformers	1	1682	September 29, 2021
Any examples on VisualBERTforMultipleChoice 🤗Transformers	1	415	March 3, 2022
Run detectron2 for feature extraction in SageMaker notebook Amazon SageMaker	8	2259	March 16, 2022

VisualBert Embeddings

Related topics