Pipeline max_length

harshit-ranjan · February 23, 2024, 9:20am

What does the max_length argument in the pipeline function do?

    pipe = pipeline('text2text-generation', model=self.model, tokenizer=self.tokenizer)
    output = []

    for ele in dataset:
        pred = pipe(ele, max_length=1024)
        output.append({'input' : ele ,'output': pred[0]['generated_text']})

dgunzy · February 23, 2024, 1:09pm

I think thats the max length in tokens the model can accept, could be wrong though.

RaushanTurganbay · February 23, 2024, 8:30pm

Hi! The max_length here controls for maximum tokens that can be generated. The generation stops when we reach the maximum. Note that the model might generate incomplete sentences, if you specify max_length too short, by default it is 20 tokens.

Topic		Replies	Views
How to set 'max_length' properly when using pipeline? 🤗Transformers	4	1601	November 18, 2024
Limit max # of tokens for inference in pipeline? Beginners	0	1080	April 7, 2023
Issue with max_length 🤗Transformers	1	2467	September 27, 2020
Getting error even after setting the max_length Beginners	1	2060	November 30, 2023
How does the pipeline deal with too long sequences? Beginners	3	89	January 17, 2025

Pipeline max_length

Related topics