Continue training XLNet on a specific closed-domain dataset

krannnN · July 16, 2020, 9:54pm

I’m wondering what if we want to leverage the already pre-trained XLNet model (and its language knowledge) and fine-tune on a specific closed-domain dataset, say legal domain for example.

I have already corpora I’m just missing how to do this with XLNet like models.

Any thoughts on how to do that?

valhalla · July 17, 2020, 4:29am

Hi @krannnN, you can use the run_language_modelling script to fine-tune xlnet. You can fine it here. You’ll just need to provide the dataset in the required format.

krannnN · July 19, 2020, 1:39pm

thank you @valhalla, for your reply, the readme file doesn’t mention xlnet models.

export TRAIN_FILE=/path/to/dataset/wiki.train.raw
export TEST_FILE=/path/to/dataset/wiki.test.raw

python run_language_modeling.py \
    --output_dir=output \
    --model_type=xlnet\
    --model_name_or_path=xlnet \
    --do_train \
    --train_data_file=$TRAIN_FILE \
    --do_eval \
    --eval_data_file=$TEST_FILE

is there anyway to precise that I want to continue training from the last checkpoint and not do the training from scratch ?

Thanks

Topic		Replies	Views
Continue training XLNet on domain-specific data stuck in Creating features 🤗Transformers	0	349	July 24, 2020
Fine-tuning XLNet for permutation language modeling: what is the required format of the train data? 🤗Transformers	0	675	July 21, 2021
How to make a model like wav2vec or xls-r for my custom dataset and use it for fine tuneing Beginners	0	179	January 19, 2024
Pre-training & fine-tuning BERT on specific domain with custom dataset Beginners	4	4266	August 10, 2021
How can train a POS model with XLNET? Beginners	2	267	April 18, 2022

Continue training XLNet on a specific closed-domain dataset

Related topics