Hi,
I was following this tutorial to train a LM from scratch: How to train a new language model from scratch using Transformers and Tokenizers
The result is a pytorch model. Though I need one for Tensorflow. Is there an easy way to convert it?
I tried to modify the training code by using TFTrainer, TFBertForModelLM instead. but TFTrainer is causing trouble with the Data_collator and LineByLineTextDataset objects.
When intitializing the trainer with the data collator i get the error: init() got an unexpected keyword argument âdata_collatorâ
When calling trainer.train() (without collator) I receive the error: LineByLineTextDataset object has no attribute â_variant_tensorâ .