In BertSUM paper they say that the summarization happens after the output of bert, they use a stack of specific summarization layers ( two layers of transformers worked best) then add a softmax layer to get which sentence should be included into the summmary. Now in their code there are classes for clustering, what’s the use of these classes/ methods? https://pypi.org/project/bert-extractive-summarizer/
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Transfer Learning an Extractive Summarization Bert Model such as BertSUM? | 0 | 1711 | September 27, 2021 | |
Clustering news articles with sentence bert | 15 | 19993 | October 29, 2023 | |
Anyone have advice on best methods to cluster BERT-embedded documents? | 2 | 2532 | August 31, 2021 | |
Extracting the output of hidden BERT layers and re-training the BERT model on custom datasets | 0 | 810 | March 17, 2021 | |
Current best practice for final linear classifier layer(s)? | 3 | 2422 | September 12, 2020 |