Can an EncoderModel be trained on top of a concatenation of BertModel [CLS] embeddings with additional input data using the transformers library?

niquet · December 9, 2022, 11:29am

Hello dear HuggingFace community,

I am currently trying to develop a BertForSequenceClassificationClass that gets the [CLS] vector of BertModel embeddings as input as well as additional data (ca. 50 dimensions). I concatenate the extra data with the [CLS] embedding and now need to train an Encoder to learn non-linear relationships in the input. The goal is to use this model for classifying different text blocks on Web pages, using both the text in a text block as well as its tag path represented as a sparse vector.

Is it possible to provide the EncoderModels with such an input concatenation? If it is, could anyone provide me with a snippet or link on how to implement and train it?

(Edit: Alternatively, if it is not possible to use EncoderModels provided by the HuggingFace transformers library, how can I use such input with a BiLSTM and subsequent Classification head?)

Thank you very much in advance!

Topic		Replies	Views
How to efficiently convert a large parallel corpus to a Huggingface dataset to train an EncoderDecoderModel? 🤗Datasets	10	2768	October 28, 2022
Transformers, am i only using a Encoder for Binary Classification? Beginners	1	1630	January 4, 2021
Separate pre-trained encoder and decoder Models	0	437	October 4, 2023
EncoderDecoderModel converts classifier layer of decoder Beginners	2	531	October 26, 2021
Multimodal architectures with HuggingFace transformers for speech and text 🤗Transformers	3	1132	November 14, 2022

Can an EncoderModel be trained on top of a concatenation of BertModel [CLS] embeddings with additional input data using the transformers library?

Related topics