Generating decoder input ids during inference for opus-mt

HarshitaDiddee · August 25, 2021, 6:28pm

Hello,

Goal: I am trying to run an inference on the tensorflow lite variant of the opus-mt (en-hi) model.

Question: I just wanted to confirm why is the model supplied decoder_inputs_ids during inference and why are they, if I have correctly understood their assignment, being assigned with shifted input id’s. Given that during training - I will be supplying the target_sequences through this parameter, will this not lead to a kind of inferential inconsistency in the model’s operation ?

Context: During inference, the interpretor requires 3 inputs - the input_ids, decoder_input_ids and the attention_masks that I have to provide at the edge side. From my understanding, HF’s generate() function already handles the generation of two of the optional inputs i.e decoder_input_ids and the attention masks. In my case, I need to supply them mandatorily. I looked through this to get an idea about their generation and inferred them to be -

decoder_input_ids set as inputs_ids shifted by the pad_token_id ( as specified in HF’s Marian MT Documentation )
attention_masks as a default set attention mask ( naively a numpy.ones(<batch_len, max_seq_length>))

Note: I have been able to successfully generate an output using these assignments for the tflite interpretor - So this isn’t particularly a syntactic issue.

Thanks in advance for any help!

Topic		Replies	Views
The meaning of 'decoder input ids' in encoder-decoder model Beginners	1	2378	July 29, 2022
Does attention_mask refer to input_ids or to labels? Beginners	7	31	June 19, 2025
T5 - Padded decoder inputs yields differerent results Beginners	1	725	June 14, 2022
What should be shifted for decoder input for Bart Beginners	1	329	July 8, 2021
Is T5 expected to ignore padding tokens in `decoder_input_ids` when `decoder_attention_mask` is not provided 🤗Transformers	4	2690	April 5, 2023

Generating decoder input ids during inference for opus-mt

Related topics