Model predicting with fine-tuned model with Keras

IonVSmlnlp · June 13, 2022, 4:21pm

Hi!
I am implementing the steps provided by the book “Natural Language Processing with Transformers” (by Lewis Tunstall, Leandro von Werra and Thomas Wolf) for text classification; so, I am referring to the chapter 2, “Text classification”.

The fine-tuning I chose to implement was the one suggested at p.50 (Fine-Tuning with Keras).
The first thing I had to do in order for the model to work was to provide a collate_fn argument to the to_tf_dataset() function, because Keras, apparently, has changed the required arguments since the publishing of the original version book.

The code I used was the following (I followed almost the same steps presented by the book):
from transformers import TFAutoModelForSequenceClassification
tf_model=(TFAutoModelForSequenceClassification.from_pretrained(model_cpkt,num_labels=num_labels)) #p.46 #of the book

tokenizer_columns=tokenizer.model_input_names

from transformers import DataCollatorWithPadding # Implemented by myself given that the collate_fn #argument is required

data_collator = DataCollatorWithPadding(tokenizer=tokenizer, return_tensors=“tf”)

tf_train_dataset=emotions_encoded[‘train’].to_tf_dataset(columns=tokenizer_columns,label_cols=[“label”],shuffle=True,batch_size=16,collate_fn=data_collator)
tf_eval_dataset=emotions_encoded[‘validation’].to_tf_dataset(columns=tokenizer_columns,label_cols=[“label”],shuffle=False,batch_size=16,collate_fn=data_collator)

tf_model.compile(
optimizer=tf.keras.optimizers.Adam(learning_rate=5e-5),
loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
metrics=tf.metrics.SparseCategoricalAccuracy()
)

tf_model.fit(tf_train_dataset,validation_data=tf_eval_dataset,epochs=2)

The model was trained and returned a good accuracy (above 0,90). Now, my question is the following: what is the syntax for predicting the classes of new texts, for example for the tf_eval_dataset PrefetchDataset element ? I tried many configurations for the input tensor, but all failed. I always used the tf_model.predict() to try to predict new elements.

Second question: what would be the code to predict a given sentence already encoded into a variable ? Will I still have to use the Dataset functions in order to properly encode a given string ?

Thank you !
Ion

Topic		Replies	Views
Fine tune Transformers for text generation 🤗Transformers	11	12035	July 27, 2023
Am I doing this right? Beginners	1	509	July 12, 2020
Cannot get DataCollator to prepare tf dataset 🤗Transformers	0	477	July 15, 2022
Error in model.prepare_tf_dataset Beginners	4	250	June 14, 2024
How to use transformers&tensorflow for batch inference Beginners	0	528	August 20, 2021

Model predicting with fine-tuned model with Keras

Related topics