[Tensorflow Export] How to export a fine tuned GPT2 model to a tensorflow model file?

farazk86 · January 14, 2021, 11:25pm

Sorry if this is a silly question, but I cant seem to find any proper solution to this.

I am using transformers==2.8.0 and have fine-tuned a gpt2 model with my own dataset. I know that during training it creates checkpoints in pytorch and that can be used for text generation, but I want to save/load model in tensorflow.

I know that TFGPT2LMHeadModel exists and that it can be used, but I havent found an example online doing this.

Can someone help me please? How can I export a fine-tuned model into tensorflow, so that I can then generate text using that model?

Thanks

sgugger · January 15, 2021, 2:20pm

You can check this part of the docs that shows how to reload a model trained with PyTorch into TensorFlow (and vice versa).

Topic		Replies	Views
How to load finetuned model in TF Beginners	2	450	September 28, 2020
Loading finetuned model to generate text 🤗Transformers	12	3315	August 7, 2023
Finetune GPT2 in tensorflow on custom data example programmatically Beginners	0	487	July 23, 2020
Fine-tuning GPT2 for text-generation with TensorFlow Beginners	4	5699	July 24, 2022
GPT2 with TensorFlow? 🤗Transformers	1	372	November 14, 2020

[Tensorflow Export] How to export a fine tuned GPT2 model to a tensorflow model file?

Related topics