How to use T5 for sentence embedding?

banucool · September 12, 2020, 12:11pm

is there any way to use encoder part of T5 model for representation learning?

valhalla · September 12, 2020, 1:11pm

Hi @banucool
You can initialize the T5Model class and only forward pass through it’s encoder. The first element of the returned tuple is the final hidden states.

model = T5Model.from_pretrained("t5-small")
tok = T5Tokenizer.from_pretrained("t5-small")

enc = tok("some text", return_tensors="pt")

# forward pass through encoder only
output = model.encoder(
    input_ids=enc["input_ids"], 
    attention_mask=enc["attention_mask"], 
    return_dict=True
)
# get the final hidden states
emb = output.last_hidden_state

The shape of emb will be (batch_size, seq_len, hidden_size)

banucool · September 12, 2020, 2:06pm

thanks a lot @valhalla

banucool · September 12, 2020, 3:33pm

can we use pruned version of bert for feature extraction?does it make sense?

valhalla · September 12, 2020, 6:08pm

To clarify, the above code just returns the final hidden state of each token and not whole sentence embedding.
for sentence embedding you can try sentence-bert.
https://huggingface.co/sentence-transformers

mriggs · May 31, 2022, 5:01pm

valhalla:

model = T5Model.from_pretrained("t5-small")
tok = T5Tokenizer.from_pretrained("t5-small")

enc = tok("some text", return_tensors="pt")

# forward pass through encoder only
output = model.encoder(
    input_ids=enc["input_ids"], 
    attention_mask=enc["attention_mask"], 
    return_dict=True
)
# get the final hidden states
emb = output.last_hidden_state

Hi, I’m interested in using T5 to generate word embeddings. I tried the code supplied above. Unfortunately, got this error message:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-40-5f6e22d1ad1e> in <module>()
      1 model = T5Model.from_pretrained("t5-small")
----> 2 tok = T5Tokenizer.from_pretrained("t5-small")
      3 
      4 enc = tok("some text", return_tensors="pt")
      5 

TypeError: 'NoneType' object is not callable

Do you have any thoughts on resolving this error message?

Thank you in advance for your help.

guotong1988 · May 27, 2023, 2:25am

Topic		Replies	Views
Initializing T5Encoder model Models	2	2615	June 20, 2022
Extracting token embeddings from pretrained language models Beginners	9	22201	May 2, 2024
What should be used as sentence embedding for BertModel? Beginners	0	1910	May 24, 2021
How to use the encoder only from T5? Beginners	0	673	April 9, 2022
How to get sentence embedding using a fine-tuned model Intermediate	0	263	April 18, 2023

How to use T5 for sentence embedding?

Related topics