How to load finetuned model in TF

georgedittmar · September 28, 2020, 5:54am

So I trained a new gpt2 model using the language modeling script and it spits out a pytorch.bin file that I assume is the model. How do I go about converting it to TF 2.0? I see there is a convert script but I am not sure if I can just run that to make it TF compatible?

sgugger · September 28, 2020, 2:30pm

You should create your model like this:

from transformers import TFGPT2Model
model = TFGPT2Model.from_pretrained("path_to_dir", from_pt=True)

where path_to_dir should be replaced with the path to the directory where your pytorch model is (what you set in GPT2Model.save_pretrained()).

Then you can use your TF model and save it with the save_pretrained method.

georgedittmar · September 28, 2020, 8:15pm

Awesome, I’ll give that try. I missed that from_pt param.

Topic		Replies	Views
[Tensorflow Export] How to export a fine tuned GPT2 model to a tensorflow model file? Beginners	1	523	January 15, 2021
I am using TFGPT2LMHeadModel and GPT2LMHeadModel, when i use tensorflow version to load pytorch_model.bin,there are some weight can not be used 🤗Transformers	0	287	August 2, 2022
Converting GPT2 to JavaScript? Intermediate	1	1635	April 17, 2021
.pt PyTorch Model ->PreTrainedModel Beginners	4	786	May 1, 2024
How to load a google's bert ckpt using tf2 🤗Transformers	3	1310	August 14, 2020

How to load finetuned model in TF

Related topics