Transformer's output as input to other model

omerarshad · January 21, 2021, 5:29am

Hello,

I want to create a model which generates text and the generated text is input to other model. So basically two models are trained together. How can i achieve this using hugging face?

Thanks

marcoabrate · January 22, 2021, 8:39am

Welcome to the forum, @omerarshad!

Nice questions, I had the same problem, too. In my opinion this is possible only if you have ground truth for the intermediate step and not only the final reference. What you might do is to train two models separately: the first one with the intermediate reference, and the last one with the final reference.

Schematically:

Input -> MODEL_1 -> Output_1
                        | compare (cross-entropy)
             Intermediate Reference

Intermediate Reference -> MODEL_2 -> Output_2
                                        | compare (cross-entropy)
                                  Final Reference

What do you think?

omerarshad · January 22, 2021, 10:19am

Yes, i have ground truth for both models, and we can train them separately. The only issue i am facing is how to do using Hugging face? The task is that first model writes answer of a question, and second model takes this answer as input and generates question

marcoabrate · January 22, 2021, 11:26am

So you have questions references and answer references (let’s call them Q_REF and A_REF). Then you first take the model that answers, let’s call it MODEL_A and you train it with A_REF. After that, you take MODEL_Q and you train it with Q_REF. Once you have the fine-tuned models you can just have a question as input and produce an answer with MODEL_A, after you can take that answer and you give it to MODEL_Q to have a new question, if I understand correctly.

You can check out the milion examples on how to train a model for Q&A with your own dataset. Or you can start reproducing results with the SQuAD one. Anyway, I never did it so I am not the best to help you on that one.

omerarshad · March 27, 2021, 6:33pm

how can this be achieved using hugging face? Remember these both models will be trained together

Topic		Replies	Views
Fine-tune model with CoT Intermediate	1	394	January 27, 2025
How to create a new Hugging face model by using already available hugging face models 🤗Transformers	2	152	May 1, 2024
Quick question about testing huggingface seq2seq example Beginners	0	504	June 28, 2021
Model with Multiple inputs to yield Multiple Outputs 🤗Transformers	0	507	July 25, 2023
Using custom models (not necessarily transformer based) with generate() and sampling Beginners	2	1220	March 1, 2022

Transformer's output as input to other model

Related topics