100% accuracy when using a ViT model?

mbluetail · July 22, 2022, 2:10pm

I have used google’s ‘vit-base-patch16-224-in21k’ pretrained ViT model for my image dataset.

I have sequences of images (some 1315 images in total, which is not a large dataset), which I am trying to classify as human present, no human present, which is binary classification.

Model: “model_6”

Layer (type) Output Shape Param #

input_7 (InputLayer) [(None, 3, 224, 224)] 0

vit (TFViTMainLayer) TFBaseModelOutputWithPool 86389248

global_average_pooling1d (Gl (None, 768) 0

dense_12 (Dense) (None, 256) 196864

dropout_40 (Dropout) (None, 256) 0

outputs (Dense) (None, 1) 257

Total params: 86,586,369
Trainable params: 197,121
Non-trainable params: 86,389,248

None

After training, I get 100% accuracy on my test, validation and training dataset!

my accuracy and loss curves look like this.

how can it be possible?

Topic		Replies	Views
Help! - Drastic Overfitting and Atrocious Accuracy on ViT Model 🤗Transformers	0	700	July 23, 2022
Improving precision of ViT for image classification Beginners	0	77	December 6, 2024
InvalidArgumentError with vit-base-patch16-224 model? Models	7	2003	April 20, 2024
When Fine-Tune the google/vit-base-patch16-384, the train loss is 0 and the eval loss is NaN 🤗Transformers	9	737	January 19, 2024
What is the best way to fine-tune ViT with a custom dataset? Beginners	2	4103	January 12, 2025

100% accuracy when using a ViT model?

Layer (type) Output Shape Param #

outputs (Dense) (None, 1) 257

Related topics