Hey @lqtrung The position_ids don’t need to be passed, as long as the right attention_mask is. prepare_inputs_for_generation (see here) takes care of that for you
position_ids
attention_mask
prepare_inputs_for_generation