Save CamemBert model wrapped in keras

Toropicana · November 2, 2020, 9:38am

Hi,
I am working on fine-tuning a camembert model for a text classification task.
So here is what I am doing :
I am loading a TFCamembertModel from pretrained base architecture.
I call a function that builds the model as follow :

The layer in the model is basically the transformer outputting the CLS token used for classification.

My question is the following: how do I save the whole CamemBertModel i.g not just the layer which would force be to use the CLS token for downstream-task. I want the per word context vector, that I would average in a single vector representing whole text.

Tell me if that approach makes sense, of if I should rather use the CLS token.

Topic		Replies	Views
How to save, load and use my text classification model? Beginners	2	1485	July 30, 2021
BERT (CamemBERT) for Sequence Classification maps any sequence to the exact same encoding Models	0	206	July 7, 2023
Trouble saving and loading a finetuned model Beginners	1	309	July 7, 2024
Properly loading a fine tuned model from directory Intermediate	2	2050	August 25, 2020
Issue with using a save_pretrained model (MarianMT) 🤗Transformers	1	447	April 5, 2023

Save CamemBert model wrapped in keras

Related topics