How to finetune models with own dataset in tensorflow?

pritam · August 19, 2021, 7:34am

I am trying to follow the tutorial here, but I want to use my own dataset.
I have stored the texts and the labels in a pandas dataset named train. Which has only two columns text, and labels.
I have tried the following code

train_tokenized = tokenizer(list(train.text)), padding="max_length", truncation=True, return_tensors="tf")
train_features = {x: train_tokenized[x] for x in tokenizer.model_input_names}
train_tf_data = tf.data.Dataset.from_tensor_slices((train_features, train.labels))
train_tf_data = train_tf_data.batch(8)

model.compile(
    optimizer=tf.keras.optimizers.Adam(learning_rate=5e-5),
    loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
    metrics=tf.metrics.SparseCategoricalAccuracy(),
)

model.fit(train_tf_data, epochs=3)

But it gives me ValueError: Unsupported type BatchEncoding returned by IteratorSpec._serialize

Topic		Replies	Views
Error in model.prepare_tf_dataset Beginners	4	250	June 14, 2024
Can't convert non-rectangular Python sequence to Tensor. Fine tuning model with custome data 🤗Transformers	0	3289	October 24, 2022
Trying to use AutoTokenizer with TensorFlow gives: `ValueError: text input must of type `str` (single example), `List[str]` (batch or single pretokenized example) or `List[List[str]]` (batch of pretokenized examples).` 🤗Tokenizers	11	19919	October 5, 2024
Model predicting with fine-tuned model with Keras Beginners	0	322	June 13, 2022
Use tf.data.Data with HuggingFace datasets 🤗Transformers	2	2638	March 22, 2021

How to finetune models with own dataset in tensorflow?

Related topics