Use EncoderDecoder models for text summarization

lewtun · November 9, 2021, 7:29pm

Please read the topic category description to understand what this is all about

Description

Most of the available Transformer models for text summarization are only available for English documents. At the same time, there are now many pretrained BERT-like models in non-English languages. The goal of this project is to explore whether the [EncoderDecoder architecture](https://huggingface.co/transformers/model_doc/encoderdecoder.html?highlight=encoderdecoder#encoder-decoder-models) in Transformers can be used to create summarization models using just the pretrained weights of encoder-based models.

Your task is to pick a pretrained encoder in a non-English language and train it to summarise texts in that language.

Model(s)

See here for example models that people have fine-tuned using this architecture. You task is to create your very own model with this technique!

Datasets

Search for summarization datasets on the Hub to get an appropriate corpus for this task

Challenges

Text summarization is a tricky NLP task, so the performance obtained with these models may not match what is observed for their English couterparts (where much more data is available)

Desired project outcomes

Create a Streamlit or Gradio app on Spaces that can summarize a document in your chosen language
Don’t forget to push all your models and datasets to the Hub so others can build on them!

Additional resources

Leveraging Pre-trained Checkpoints for Sequence Generation Tasks [PAPER]
Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models [BLOG POST]
Examples of these models on the Hub by @mrm8488: https://twitter.com/mrm8488/status/1458475725565141001?s=20

Discord channel

To chat and organise with other people interested in this project, head over to our Discord and:

Follow the instructions on the #join-course channel. Then join one of the following channels:

#encoder-decoder-es channel (Spanish)

Just make sure you comment here to indicate that you’ll be contributing to this project

edumunozsala · November 16, 2021, 11:59am

Interesting project, I am interested in a Spanish summarizer using Encoder-Decoder model. Anyone else interested in this approach?

lewtun · November 16, 2021, 12:19pm

Hey @edumunozsala, cool to hear that you’re interesting in tackling this project! I’ve created a Discord channel (see topic description) in case you and others want to use it

Parmida · December 28, 2023, 12:00pm

I am interested to train it for Persian (Farsi) language. how can I contribute?

Topic		Replies	Views
Summarization taks, looking for clarifications before getting started Beginners	10	983	February 16, 2021
Build a news summarizer 🤗 Course Projects	7	2542	November 19, 2021
Generating summaries with encdoer input + few decoder inputs using T5 Beginners	0	253	April 28, 2022
Leveraging pre-trained checkpoints for summarization Models	33	3171	November 25, 2022
Fine tuning reformer model 🤗Transformers	0	376	August 30, 2020