BART summarization: strategies to improve entity preservation

rubenk · November 3, 2021, 9:46am

I’m currently working on a model that should perform extractive summarization, flatten the output and make it conform to a style guide. BART for conditional generation has delivered very good results so far. The main issue the model has seems to be the tokenization of entities with strange names, mainly companies. For instance, Boeing confuses the model because it looks like Boe + ing.

I see two main options:

Unsupervised fine-tuning of the BERT-style encoder on the kind of text that needs to be summarized. See also this post.
Performing a NER preprocessing step, which could place markers around the entities (Boeing to ) so that the model can see that these entities never change.

I’m not sure how 1. would be done, technically speaking.

On the other hand 2. seems viable, has anyone done something similar in the past? What would be a smart way to wrap the entities?

If someone has a better idea, I’d be very grateful to hear that, too!

Topic		Replies	Views
Train Bart for Conditional Generation (e.g. Summarization) Models	14	17159	November 22, 2023
BART with custom encoder and decoder Models	5	921	May 25, 2023
BART fill-mask and generate summaries Models	0	321	October 11, 2021
BART Paraphrasing Beginners	6	3079	February 18, 2022
[Beginner] fine-tune Bart with custom dataset in other language? Beginners	2	3232	January 22, 2021

BART summarization: strategies to improve entity preservation

Related topics