Multiple texts as inputs to Transformers models

Zack · November 25, 2020, 11:49pm

I would like to use multiple texts as inputs to a model, let’s say I have a dataset with 10 columns each column is a text (sentence or two), how can I fit all these inputs to the model and do a classification for example ?
I can see it’s possible to just concatenate all texts in one, but seems that for me, I need a very large data to be apple to achieve good accuracy.
Maybe using multiple models (BERT) in parallel, taking last hidden state, concatenate them and classify ? But the problem is that there’s so many values order of 30 texts.

Any idea how to tackle this ?

astariul · November 26, 2020, 7:15am

You should take the same approach as Extractive text summarization :

Concatenate all your sentences, separated with a special token (CLS for example), then use the CLS token representation to do classification.

From the Presumm paper

Zack · November 26, 2020, 11:47am

Hi @astariul, thank you for your reply.
I understand what you suggested, the problem is that I don’t have only texts as inputs I have also some floats values, is converting this values to text would be sufficient ?

astariul · November 26, 2020, 11:33pm

I see…

I never encountered this case myself, but maybe you can directly input the float values in the last classifier ?
Since it’s not text, there is no need for BERT to encode it (?)

scroobiustrip · December 7, 2020, 10:18pm

Hi @Zack

I don’t know whether you’ve tried / considered the multimodal toolkit (blog post, github)- takes in tabular data (text, numbers, categorical data) and can use them as inputs to develop models.

Haven’t tried it myself, but looks quite promising.

Inder1207 · July 11, 2023, 3:10pm

Hello @astariul ,

Thank you for your answer. I am also trying to do something similar but I just had a question though. So, even after concatenating the data will the different input data be provided with different weights and biases in this case? or the whole input text itself will be provided with weights and biases?

astariul · July 12, 2023, 12:09am

You can do both.

In the case of BERT for summarization, they just use sentence-specific representation :
[CLS] sen1 [SEP] [CLS] sen2 [SEP] [CLS] sen3 [SEP]
Then use each CLS token as a representation of each sentence.

But if you want a general representation for the whole text, you can just train your model with an additional token at the beginning :
[CLS2] [CLS] sen1 [SEP] [CLS] sen2 [SEP] [CLS] sen3 [SEP]

Then use CLS for sentence representation, CLS2 for token representation.

Inder1207 · July 25, 2023, 12:41pm

Hello @astariul , Thank you for your reply again. I was just curious to know if I can build a model using huggingface transformers which can give me multiple outputs?

astariul · July 25, 2023, 1:52pm

Yes you can.
Regular models have a single output (usually a LM head or a classification head) on top of the Transformer stack, but you can just add several different heads on top of the Transformer stack to get multiple outputs.

samme80 · September 13, 2024, 11:50am

@astariul , Hi I am also working on similar project. My use case has only text with multiple inputs and multiple outputs. If you know resources around it please help. Currently using hugging face’s trainer giving me better accuracy

Topic		Replies	Views
Model with Multiple inputs to yield Multiple Outputs 🤗Transformers	0	521	July 25, 2023
Concatenate non string features to a BERT transformers model Beginners	5	2864	March 27, 2022
Feed output from one transformer model as input to another 🤗Transformers	1	1112	July 30, 2021
Extracting the output of hidden BERT layers and re-training the BERT model on custom datasets 🤗Transformers	0	818	March 17, 2021
Looking for tool class to do predictions 🤗Transformers	3	564	October 9, 2020

Multiple texts as inputs to Transformers models

Related topics