Hello there!
I am trying to adapt the official Google Colab for language generation to tensorflow
and everything seems to work wonderfully by simply appending TF
to most of the huggingface
function calls (TFAutoModel
, etc)
Unfortunately, this strategy fails at the training
step:
from transformers import TFTrainer, TFTrainingArguments
import tensorflow as tf
training_args = TFTrainingArguments(
"test-clm",
evaluation_strategy = "epoch",
learning_rate=2e-5)
trainer = TFTrainer(
model=model,
args = training_args,
train_dataset=lm_datasets[0:1000],
eval_dataset=lm_datasets[1000:])
trainer.train()
self.num_train_examples = self.train_dataset.cardinality().numpy()
AttributeError: 'dict' object has no attribute 'cardinality'
I have absolutely no idea what this cardinality
is. Do you know what the issue can be?
Thanks!