Is this the right way prompt summarization with BART?

css · May 25, 2022, 1:59pm

Hello!

I recently figured out a way to prompt the summarization of Encoder/Decoder models like BART using the generate() function. Normally the only prompting we get with this function appears to be the starting token for the generation (decoder_start_token_id).

However, we can see that the GenerationMixin does accept a **kwargs which gets forwarded to the underlying model!

In BART’s case, we have another parameter to its forward, decoder_input_ids, which looks like it allows you to set a custom Q vector for the decoder, for that function call.

Using these together, we can appear to make generate() start the beam search given the encoded input, and the prompt to the decoder:

My question is: Is this process doing what I think it is? The results seem to make sense, and I can loosely verify this method by prompting with only "<s>", which gives identical output to the defaults for generate(). Is this a good way to prompt summarization for the BART model?

Thanks in advance!

Thang · March 18, 2023, 9:05pm

It seems you did have an excellent way of using prompts, according to your example. I am new to this and like to hear do you have any new progress on this?

Topic		Replies	Views
Using generate() method with decoder Models	0	570	January 16, 2022
Using the decoder half of BART for causal generation Models	4	2799	May 2, 2022
How to use BART as an encoder and a decoder separately for summarization? 🤗Transformers	1	821	September 22, 2021
How to properly prompt the decoder? 🤗Transformers	0	833	May 20, 2023
Multi-decoder text generation with BART 🤗Transformers	0	627	June 7, 2021

Is this the right way prompt summarization with BART?

Related topics