Is summary of 1024 tokens not useless?

MaximeTut · July 1, 2022, 1:42pm

I am trying to use summarization with the pipeline transformers but I can’t use an input > 1024 tokens, whatever the model, which is barely the lenght of a summary.

I guess summarization is to extract information from a long text, not from a couple of lines that we can read in 2 minutes. Maybe I am missing something ? Because a summary of a text of 1024 tokens seems for me quite useless…

nbroad · July 1, 2022, 6:30pm

Look into the models meant for long sequences (bigbird, longformer, long t5)
google/bigbird-pegasus-large-bigpatent
allenai/led-large-16384-arxiv
pszemraj/long-t5-tglobal-base-16384-book-summary

Topic		Replies	Views
Which summarization model of huggingface supports more than 1024 tokens? Which model is more suitable for programming related articles? 🤗Transformers	1	1761	July 31, 2023
Summarization on long documents 🤗Transformers	63	59018	August 16, 2024
Output truncation of summaries models 🤗Transformers	0	442	March 30, 2023
Long summarization 🤗Transformers	0	324	August 9, 2022
Summarization pipeline on long text Beginners	6	4530	December 14, 2022

Is summary of 1024 tokens not useless?

Related topics