Passing output of BART to another model

halamvac · September 24, 2022, 9:13am

Hello,

I’m trying to finetune BART model using custom loss. What I need is to generate text with BART, pass this text as a part of the input to another model and then compute loss and backpropagate this whole system. The second model’s weights are frozen and it’s output is between 0 and 1.

In psedoucode I want something like this:

def compute_loss(self, model, inputs):
    first_output, first_loss = first_model.generate(inputs)
    text = decode(output)
    second_output, second_loss = second_model(text)
    loss = loss_function(second_output, targets)
    return loss

How can I do something like this as the decoding is not differentiable?

Thank you

Topic		Replies	Views
Finetuning BART using custom loss Beginners	9	22908	September 25, 2022
Creating a custom loss function for token appearance based in BART on the input Intermediate	0	440	February 11, 2022
A question about the modeling_bart.py Models	1	324	November 12, 2020
What can cause model.generate (BART) output to be gibberish after fine-tuning? Beginners	3	4422	August 31, 2020
Loading weights of BART model into a different architecture Models	0	389	December 29, 2021

Passing output of BART to another model

Related topics