Effect of selecting both `do_sample` and `num_beams` in summarization pipeline

AndreaSottana · September 13, 2022, 12:58pm

I am using a summarization pipeline to generate summaries using a fine-tuned model. The summarizer object is initialised as follows:

    summarizer = pipeline(
        "summarization", 
        model=model, 
        tokenizer=tokenizer, 
        num_beams=5, 
        do_sample=True, 
        no_repeat_ngram_size=3,
        max_length=1024,
        device=0,
        batch_size=8
    )

According to the documentation, setting num_beams=5 means that the top 5 choices are retained when a new token in the sequence is generated based on a language model, and the model moves forward discarding all other possibilities, so that 5 options are carried over all the time. However, this option seems to apparently be incompatible with do_sample=True where it seems that new tokens are picked based on some random strategy (which doesn’t have to be uniformly random of course, but I don’t know the details of this process). Could anyone explain clearly how num_beams=5 and do_sample=True would work together (no error is raised so I assume this is a valid configuration).

Many thanks

Topic		Replies	Views
Summariser pipeline giving different results on same model with fixed seed Intermediate	0	870	August 17, 2022
Summarization pipeline on long text Beginners	6	4514	December 14, 2022
Num_beams: Faster Summarization without Distillation 🤗Transformers	1	588	November 12, 2020
Do_sample defaults to False when I'm trying to set to True Beginners	5	2671	June 18, 2024
I cannot get batches of data in pipelines Beginners	0	219	May 1, 2023

Effect of selecting both `do_sample` and `num_beams` in summarization pipeline

Related topics