Uable to load model when using more_general_trainer_metric branch

saied · September 24, 2021, 8:28am

Hey everyone
I tried to use code from bert2gpt2-cnn_dailymail-fp16 in order to train for different models but whe I want to load GPT-2 model

model = EncoderDecoderModel.from_encoder_decoder_pretrained("HooshvareLab/bert-fa-base-uncased", "flax-community/gpt2-medium-persian")

i got this:

OSError: Can't load config for 'flax-community/gpt2-medium-persian'. Make sure that:

- 'flax-community/gpt2-medium-persian' is a correct model identifier listed on 'https://huggingface.co/models'

I also tried to clone the model repo manually and load it locally but I got this error

OSError: Unable to load weights from pytorch checkpoint file. If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True.

Note that I use colab and I install more_general_trainer_metric branch of transformer with this command:

!pip install git+git://github.com/huggingface/transformers.git@more_general_trainer_metric

Any help would be appreciated
Thanks

Topic		Replies	Views
Can't load weights for gpt2 error Beginners	0	1610	July 13, 2020
Cannot load a saved (fine-tuned) model? 🤗Transformers	1	1535	August 18, 2021
Custom GPT2 Model won't load after training Intermediate	1	1170	September 15, 2021
How to load finetuned model in TF Beginners	2	450	September 28, 2020
GPT2LMHeadModel.from_pretrained('gpt2') not loading attn weights Beginners	1	2109	July 22, 2020

Uable to load model when using more_general_trainer_metric branch

Related topics