Batch generation with GPT2

Hey @lqtrung :wave: The position_ids don’t need to be passed, as long as the right attention_mask is. prepare_inputs_for_generation (see here) takes care of that for you :smiley: