PEGASUS extracting from input instead of abstrative summarization

Skylixia · June 16, 2021, 9:47am

Hello,

When using PEGASUS_large with the summarization pipeline, all the model does is extract from the input. I am surprised by this as even if not fine tuned the pretraining should not have made the model learn to just copy full sentences. Any idea why this happens ?

The code I am using is the following:

model_name = 'google/pegasus-large'
model = PegasusForConditionalGeneration.from_pretrained(model_name)
tokenizer = PegasusTokenizer.from_pretrained(model_name)
summarizer = pipeline("summarization", model=model, tokenizer=tokenizer)
summary = summarizer(input, min_length=10, max_length=150,do_sample=True,no_repeat_ngram_size=3,num_beams=10)[0]['summary_text']

Using the API from the hugging face page prompt the same results (and thus also only extract).

@sshleifer

Topic		Replies	Views
How to generate a samples of summaries with Pegasus? Beginners	3	1020	October 16, 2023
Creating summaries of fixed length with PEGASUS model 🤗Transformers	1	474	July 13, 2022
Pegasus - how to get summary of more than 1 line? Beginners	1	464	December 30, 2021
Questions about Pegasus for Summarization 🤗Transformers	1	787	August 24, 2020
Pegasus Questions 🤗Transformers	29	3945	July 5, 2021

PEGASUS extracting from input instead of abstrative summarization

Related topics