Pipeline vs model.generate()

nielsr · November 17, 2022, 8:01am

Hi,

The pipeline() API is created mostly for people who don’t care too much about the details of the underlying process, for people who just want to use a machine learning model without having to implement several details like pre- and postprocessing themselves. The pipeline API is created such that you get an easy-to-use abstraction over any ML model, which is great for inference. The SummarizationPipeline for instance uses generate() behind the scenes.

On the other hand, if you do care about the details, then it’s recommended to generate text yourself by calling generate() yourself and implement pre-and postprocessing yourself.

Also note that any text generation pipeline does provide a generate_kwargs argument, which means that technically you can forward any of the keyword arguments that generate() supports to the pipeline as well.

Topic		Replies	Views
Difference between pipeline and model.generate? 🤗Transformers	2	2581	February 26, 2024
How to generate multiple text completions per prompt (like vLLM) using HuggingFace Transformers Pipeline without triggering an error? Beginners	4	2722	May 12, 2024
Different Summary Outputs Locally vs API for the Same Text Amazon SageMaker	7	2060	December 6, 2021
Difference between model.generate() and model() outputs Intermediate	2	2970	March 3, 2024
Is model.generate slower than model forward call? 🤗Transformers	1	182	August 18, 2024

Pipeline vs model.generate()

Related topics