[Keras] Fine-Tune Vision Transformer Model?

innat · August 8, 2022, 4:02pm

I’m looking keras approach to freeze and unfreeze the vision transformer model. For example, in huggingface vision model, I can do as follows

from transformers import SegformerFeatureExtractor
from transformers import TFSegformerForImageClassification as tfseg

tf_huggingface_module = tfseg.from_pretrained(
     'nvidia/mit-b0'
)
tf_huggingface_module.trainable = False

tf_huggingface_module.layers
[<transformers.models....TFSegformerMainLayer at 0x7f2ad0>,
 <keras.layers.core.Dense at 0x7f2aa6ca3650>]

Now, what if I want to freeze only few layers from bottom to middle and unfree rest of it. How should I do that in huggingface API? FYI, In keras API, we can do something like this

# top 20 layers
 for layer in model.layers[-20:]:
        if not isinstance(layer, layers.BatchNormalization):
            layer.trainable = True

I have found this blog post, but need some precise pointer.

nielsr · August 8, 2022, 4:44pm

Hi,

Refer to this answer: How to freeze GPT-2 model layers with Tensorflow/Keras? · Issue #18282 · huggingface/transformers · GitHub

innat · August 8, 2022, 9:04pm

Hello, it looks like I need to look at the model building code to get the proper attribute name, for example (model.transformer.wte.trainable). Is there any documentation regarding this, for example, vision models in this case?

nielsr · August 9, 2022, 11:27am

Hi,

from the link above:

To reach the layer you want to freeze, the best way is to navigate the code of the original model and find its attribute name.

So it’s advised to just check the implementation of the model. The implementation starts here. As you can see, TFViTModel contains a single attribute, “vit”. This leads to the TFViTMainLayer class. There we have the attributes “embeddings”, “encoder”, “layernorm” and “pooler”. So to freeze the weights of the embeddings for instance, you can do:

from transformers import TFViTModel

model = TFViTModel.from_pretrained("google/vit-base-patch16-224")

model.vit.embeddings.trainable = False

Topic		Replies	Views
Gradual Unfreezing support for Fine tuning models 🤗Transformers	3	3933	August 26, 2020
Freezing first N layers of a transformer model 🤗Transformers	0	937	August 5, 2022
Freeze Lower Layers with Auto Classification Model 🤗Transformers	6	18149	May 25, 2023
Gradual Layer Freezing with huggingface model 🤗Transformers	1	884	February 10, 2021
How to access a particular layer of Huggingface's pre-trained BERT model? Beginners	0	1176	April 12, 2021

[Keras] Fine-Tune Vision Transformer Model?

Related topics