related: How to generate multiple text completions per prompt (like vLLM) using HuggingFace Transformers Pipeline without triggering an error?
related: machine learning - How to generate multiple text completions per prompt (like vLLM) using HuggingFace Transformers Pipeline without triggering an error? - Stack Overflow