How to separately use T5 decoder

KKSAVETHEWORLD · January 10, 2022, 10:24am

I am working on a task in which I should modify the encoding results.
what I would like to do is generally like this:

input_ids = tokenizer(“i am trying hard!”, return_tensors=‘pt’).input_ids
last_hidden_state=model.encoder(input_ids=input_ids).last_hidden_state
modified_last_hidden_state = modify(last_hidden_state)
outputs = model.decoder(modified_last_hidden_state)
output_sequence = tokenizer.decode(outputs)

I think this model.decoder() actually doesn’t work as I want.

KKSAVETHEWORLD · January 10, 2022, 11:31am

reply myself:
I think this is a good try since the loss and hidden states are totally the same as the standard training process, and I will test the training process later.
the separate process:

nvishi7 · April 5, 2023, 10:30am

Have you found something in this???
Even I want to use an encoder and decoder separately.
My task involves passing the tokenized input ids to the encoder and get the last_hidden_layer and then passing those embeddings to the decoder to get the tokens further decoding those tokens.

UserDAN · December 23, 2023, 7:47am

thank you for sharing

any update on this I mean dose it work like the standard way of fine tuning ?

daidv1112 · July 7, 2024, 5:40am

Hi, is there any update on this?

Topic		Replies	Views
Fine tuning T5 Encoder and T5 Decoder separately 🤗Transformers	1	745	May 6, 2024
T5 models: About the decoder_input_ids argument Models	0	764	December 5, 2022
How can I run separately the Encoder and Decoder layers? 🤗Transformers	1	1806	November 2, 2020
Train T5 decoder only on a different language Models	0	453	March 16, 2021
How to use the encoder only from T5? Beginners	0	674	April 9, 2022

How to separately use T5 decoder

Related topics