Is Sequence to Sequence the best model for paragraph classification?

moyi-druzi · January 13, 2024, 10:34pm

If I wanted to summarize/classify an entire paragraph into a single output token, could an encoder+decoder model be fine tuned to produce such behavior? For example: I want to classify an email as ‘Work_Related’.

dblakely · January 14, 2024, 4:12am

Hi @moyi-druzi, this should be possible. During training, you feed the emails into the encoder and provide the output token(s) as the labels for the decoder. E.g., you train a T5 decoder to generate something like “Work-related” or “Not work-related” and then immediately halt.

You can also use an encoder-decoder model for sequence classification by doing something like:

Feed the email to both the encoder and the decoder
In the decoder, feed the hidden states for EOS token into a classification layer
The classification layer predicts the class (work-related, not work-related)

BartForSequenceClassification is an example of this.

As for just answering the question in the title - no, probably not? Encoder-only models are a lot more common for sequence classification problems. Though it is possible with encoder-decoder models.

system · January 14, 2024, 7:17pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
(Auto) Sequence Classification model with triplets / contrastive loss Models	1	724	September 20, 2023
Generate sentences from keywords only Beginners	4	3014	November 26, 2021
Use Pegasus in Huggingface for a downstream classification task Beginners	0	265	October 10, 2022
Text classification training on long text Intermediate	3	4946	June 18, 2024
Choosing correct seq2seq model 🤗Transformers	1	1673	March 19, 2021

Is Sequence to Sequence the best model for paragraph classification?

Related topics