Text generation, text2text: change output vocabulary, output distribution dimensions

Kawaka · March 11, 2021, 1:49pm

Hi,

I’d like to know if it’s possible to modify the output head in order to output (generate) only words that are present in the given context or in a given vocabulary/vocabulary size ? And how would you proceed ?

I’m currently trying with T5ForConditionalGeneration but i think that could be applied to any encoder/decoder text generating model.

I was thinking of last linear layer with output_size equals to max input size and iterate on the decoder to generate words distribution until end token. I’m a bit confuse on how to do so… if you have any resources that could help i would really appreciate your help. Thanks !

Topic		Replies	Views
Keyword generation using T5 Models	4	1990	November 2, 2022
Custom langage modeling/generate words from context Beginners	0	241	March 12, 2021
T5 Model, T5 Encoder Model and T5 Model for Conditional Generation Beginners	1	1298	November 20, 2022
How to generate text with T5Model other than T5ForConditionalGeneration? 🤗Transformers	0	299	September 22, 2022
Fine-tune MT5ConditionalGeneration for question generation Intermediate	0	487	January 4, 2022

Text generation, text2text: change output vocabulary, output distribution dimensions

Related topics