How to reproduce XLNet correctly And What is the config for finetuning XLNet?

YunpengTai · July 30, 2021, 12:55pm

I fintune a XLNet for English text classification. But it seems that I did something wrong about it because xlnet-base is worse than bert-base in my case. I set every 1/3 epoch report validation accuracy. At the beginning Bert-base is about 0.50 while XLNet-base is only 0.24. The config I use for xlnet is listed as follows:

config = {
  batch_size = 4,
  learning_rate = 1e-5,
  gradient_accumulation_steps =  32,
  epochs = 4,
  max_sep_length = 384,
  weight_decay = 0.01,
  adam_epsilon = 1e-6,
  16-bit_training = False
}

Does finetune XLNet needs a special setting or XLNet converges slowly?

Thanks for everyone willing to help in advance!

Topic		Replies	Views
Fine-tuning XLNet for permutation language modeling: what is the required format of the train data? 🤗Transformers	0	675	July 21, 2021
Fine-Tune Xlm-roberta-large-xnli 🤗Transformers	1	1922	December 28, 2021
Can't reproduce xlm-roberta-large finetuned result on XNLI 🤗Transformers	2	1918	March 10, 2021
Continue training XLNet on domain-specific data stuck in Creating features 🤗Transformers	0	349	July 24, 2020
Continue training XLNet on a specific closed-domain dataset Beginners	2	592	July 19, 2020

How to reproduce XLNet correctly And What is the config for finetuning XLNet?

Related topics