Use one-hot encoding as input for T5 and GPT

Onlydrinkwater · December 22, 2021, 12:41am

Hi,

Is it possible to train the T5 model by using onehot encoding input and integers as target?

something like that loss = model(onehot, attn, targetInList , attn)
As it’s a translation problem, the onehot input will be [ [0,0,0,1…],[0,1,0,0,…]…]

Any help would be appretiated!

Onlydrinkwater · December 22, 2021, 12:43am

If cannot, is there any other way to convert that one-hot encoding input to normal integer list? because torch.argmax is not differentiable

Topic		Replies	Views
Turn word embedding to word id (using T5 decoder) 🤗Transformers	0	339	January 8, 2022
Transformer loss 🤗Transformers	0	289	March 17, 2022
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1189	October 12, 2020
Cache T5 encoder results within batch when training 🤗Transformers	0	489	March 6, 2021
Large max differences between single input processing and batching with Bert and T5 🤗Transformers	0	560	April 26, 2021

Use one-hot encoding as input for T5 and GPT

Related topics