How to generate a sequence using inputs_embeds instead of input_ids?

Euna · March 4, 2021, 9:56am

Hello, I am struggling with generating a sequence of tokens using model.generate() with inputs_embeds.
For my research, I have to use inputs_embeds (word embedding vectors) instead of input_ids (token indices) as an input to the GPT2 model.
I want to employ model.generate() which is a convenient tool for generating a sequence of tokens, but there is no argument for inputs_embeds. I tried to edit " transformers.generation_utils", but it was not easy to figure out which lines I should change.

Is there any idea that I can easily generate tokens with default settings for hyper-paremeters as in model.generate()? If there is any idea, help me please.

StackSmasher · October 18, 2021, 2:01pm

Were you able to figure something out for this ? My needs are similar. I’m using BART model

sadakmed · October 18, 2021, 10:07pm

@StackSmasher,

consider you have the tensor inputs_embeds which I believe will be in the shape of (batch_size, seq_length, dim), or If you have a hidden_state in the shape of (batch_size, dim) just unsqueeze(dim=1) it to become (batch_size,1,dim)

Then smoothly u can achieve the desired output by:

create an attention mask (batch_size, seq_len)
create decoder_input_ids (batch_size, 1)

tracing these two parameters, leads to two functions below:

attention mask you are already familiar with it.

github.com

huggingface/transformers/blob/d5ff69fce92bb1aab9273d674e762a8eddcb2e3f/src/transformers/generation_utils.py#L394-L403

    
      
          def _prepare_attention_mask_for_generation(
              self, input_ids: torch.Tensor, pad_token_id: int, eos_token_id: int
          ) -> torch.LongTensor:
              is_pad_token_in_inputs_ids = (pad_token_id is not None) and (pad_token_id in input_ids)
              is_pad_token_not_equal_to_eos_token_id = (eos_token_id is None) or (
                  (eos_token_id is not None) and (pad_token_id != eos_token_id)
              )
              if is_pad_token_in_inputs_ids and is_pad_token_not_equal_to_eos_token_id:
                  return input_ids.ne(pad_token_id).long()
              return input_ids.new_ones(input_ids.shape, dtype=torch.long)

decoder_input_ids it’s ones too (multiplied by config.decoder_start_token_ids).

github.com

huggingface/transformers/blob/d5ff69fce92bb1aab9273d674e762a8eddcb2e3f/src/transformers/generation_utils.py#L419-L426

    
      
          def _prepare_decoder_input_ids_for_generation(
              self, input_ids: torch.LongTensor, decoder_start_token_id: int = None, bos_token_id: int = None
          ) -> torch.LongTensor:
              decoder_start_token_id = self._get_decoder_start_token_id(decoder_start_token_id, bos_token_id)
              decoder_input_ids = (
                  torch.ones((input_ids.shape[0], 1), dtype=torch.long, device=input_ids.device) * decoder_start_token_id
              )
              return decoder_input_ids

wrapping it all:

generator = BartForConditionalGeneration.from_pretrained('facebook/bart-base')
inputs_embeds = # a 3D tensor, [batch, seq_length, dim]
attention_mask = torch.ones(inputs_embeds.shape[:2], dtype=torch.long)
decoder_input_ids = torch.ones((inputs_embeds.shape[0], 1), dtype=torch.long)*generator.config.decoder_start_token_id
output_ids = generator.generate(attention_mask=attention_mask, decoder_input_ids=decoder_input_ids, inputs_embeds=inputs_embeds, max_length=100,num_beams=4)

StackSmasher · November 25, 2021, 10:47am

Hey thanks a lot for the reply and it works nice !
Slight clarification, would I need to set decoder_input_ids when I’m training the said model using inputs_embeds as well ?

anon3699016 · April 17, 2022, 4:59pm

For BART, you can use encoder_outputs, which you should get from the encoder part of BART model.

Topic		Replies	Views
Generate with inputs_embeds 🤗Transformers	0	283	April 11, 2023
How to use inputs_embeds in generate()? 🤗Transformers	5	5625	July 8, 2023
BART - Input format Intermediate	4	1785	December 13, 2023
Rewriting generate function for manual decoder input 🤗Transformers	7	3561	July 11, 2022
How to use the encoder_outputs embeding to generate a sentence through decoder 🤗Transformers	0	207	October 29, 2022

How to generate a sequence using inputs_embeds instead of input_ids?

Related topics