How to combine TF-IDF weights with transformers?

ebrky · December 11, 2022, 8:08pm

Hi, I’m trying to implement the approach suggested by this paper. In their words

“we apply the TF-IDF score in the BERT mask layer, making the different attention score for the embedding crossing”
and
“through the attention mechanism of BERT model, we converted the distance between two words at any position to 1, which effectively solves the difficult long-term dependence problem in NLP. So we can directly use the feature representation of BERT as the word embedding feature of the following task.”

I can’t figure out how to implement this. Pretty much all examples on doing arithmetic on models are on hidden states or the pooled output, and I’m not sure if TF-IDF weighing there makes any sense.

nizamsp · March 1, 2023, 6:03am

I am also interested in the same question. Any luck with this @ebrky ?

Topic		Replies	Views
Using BERT embeddings as input for transformer architecture 🤗Transformers	0	723	June 23, 2022
How to get word embedding from a TF bert model? 🤗Transformers	0	341	October 1, 2021
Can we add extra word embedding to the BERT? Beginners	6	5256	August 15, 2022
How to give weight to a word in sentence embedding by Bret? 🤗Transformers	0	608	November 15, 2022
Concatenate non string features to a BERT transformers model Beginners	5	2847	March 27, 2022

How to combine TF-IDF weights with transformers?

Related topics