How to use T5 for sentence embedding?

valhalla · September 12, 2020, 1:11pm

Hi @banucool
You can initialize the T5Model class and only forward pass through it’s encoder. The first element of the returned tuple is the final hidden states.

model = T5Model.from_pretrained("t5-small")
tok = T5Tokenizer.from_pretrained("t5-small")

enc = tok("some text", return_tensors="pt")

# forward pass through encoder only
output = model.encoder(
    input_ids=enc["input_ids"], 
    attention_mask=enc["attention_mask"], 
    return_dict=True
)
# get the final hidden states
emb = output.last_hidden_state

The shape of emb will be (batch_size, seq_len, hidden_size)

Topic		Replies	Views
Initializing T5Encoder model Models	2	2641	June 20, 2022
Extracting token embeddings from pretrained language models Beginners	9	22222	May 2, 2024
What should be used as sentence embedding for BertModel? Beginners	0	1911	May 24, 2021
How to use the encoder only from T5? Beginners	0	674	April 9, 2022
How to get sentence embedding using a fine-tuned model Intermediate	0	263	April 18, 2023

How to use T5 for sentence embedding?

Related topics