LM from Scratch for Tensorflow

jojo2k · January 15, 2021, 5:50pm

Hi,
I was following this tutorial to train a LM from scratch: How to train a new language model from scratch using Transformers and Tokenizers
The result is a pytorch model. Though I need one for Tensorflow. Is there an easy way to convert it?

I tried to modify the training code by using TFTrainer, TFBertForModelLM instead. but TFTrainer is causing trouble with the Data_collator and LineByLineTextDataset objects.
When intitializing the trainer with the data collator i get the error: init() got an unexpected keyword argument ‘data_collator’
When calling trainer.train() (without collator) I receive the error: LineByLineTextDataset object has no attribute ‘_variant_tensor’ .

Jung · January 16, 2021, 3:05am

Hi! Here’s a nice example of custom TF MLM learning on XLM-Roberta with Kaggle TPU

https://www.kaggle.com/riblidezso/finetune-xlm-roberta-on-jigsaw-test-data-with-mlm

jojo2k · January 18, 2021, 1:30pm

Thank you! Will check that link later. Sometimes I wonder why I don’t generally start my research on kaggle Has been a helpful source quite a few times.

Topic		Replies	Views
Training RoBERTa from scratch: error? 🤗Transformers	0	589	August 26, 2021
Preparing a nlp dataset for MLM 🤗Datasets	4	6145	November 8, 2024
Fine-tuning XLM-RoBERTa for binary sentiment classification Beginners	1	1437	November 4, 2021
Upload a TF model to Huggingface Intermediate	6	1065	September 1, 2021
Masked Language Modeling (MLM) using TFBertForMaskedLM (Tensorflow) 🤗Transformers	4	590	January 21, 2021

LM from Scratch for Tensorflow

Related topics