Hello folks,
I’m trying to get GPT-2 in Tensorflow 2.x to generate on TPUs using tf.distribute.TPUStrategy. This requires compiling the functions with tf.function, but I haven’t had any luck getting tf.function to work on a TFGPT2LMHeadModel.
Has anyone tried this, and / or had any luck?
Thanks.