Using .generate with TAPAS as encoder in EncoderDecoder

per · January 13, 2022, 12:35pm

Hi,
I’m trying to to train the model to perform text generation conditioned on tables.

Since TAPAS can encode the semi-structured meaning in tables, I guessed it was a good choice to use it as an encoder and say GPT2 (or any other CLM) as a decoder.

I however encountered a problem when trying to generate from that EncoderDecoder model, this:

I guess this is since model.generate() for EncoderDecoder does not expect to have the extra dimension of token_type_ids that TAPAS has.
Can anyone think of a way I can make this work?

Thanks!

per · January 17, 2022, 2:04pm

Hi, does anyone know?
@NielsRogge?

nielsr · January 17, 2022, 3:03pm

Hi,

I’ll investigate and get back to you.

nielsr · January 17, 2022, 4:02pm

Update: it works for me when overriding the _update_model_kwargs_for_generation method. The token_type_ids shouldn’t be updated, as a table only needs to get encoded once.

Notebook: Google Colab

per · January 18, 2022, 5:22pm

It works for me!
I’ll now go ahead and try to train it for conditional generation.
Thanks!

Topic		Replies	Views
How to do generation using encoder_outputs Models	0	332	February 28, 2023
SEBIS{URGENT},ValueError: You have to specify either decoder_inputs or decoder_inputs_embeds Models	3	1209	January 1, 2021
How to implement generate function for seperate encoder decoder T5 model? Models	0	851	February 10, 2022
EncoderDecoderModel Generation with Specified EOS Token Beginners	0	290	March 15, 2021
T5forConditionalGeneration Beginners	2	2286	September 15, 2020

Using .generate with TAPAS as encoder in EncoderDecoder

Related topics