Difference in return sequence for Phi3 model

@exploiter345 thanks for a detailed analysis, seems like it’s related to this issue. The fix is already merged into main, so the code should work without providing “eos_token_id” explicitly in the generation_config after updating transformers:

!pip install --upgrade git+https://github.com/huggingface/transformers.git