Finetuning BART on a multi-input sequence to sequence task

NR1 · September 22, 2021, 1:26am

I finetuned bart-base on a sequence to sequence task and I have the following questions:

a) Currently I structured the input and output for the bart model in “t5-style” by adding prefixes in front of each piece of input. For bart how should I give multiple inputs (or train it to return multiple outputs) to the model (is there special token to separate inputs, should I continue the t5-style prefixes, etc.)? Also, how would I do this for gpt-2/gpt-neo?

b) When finetuned with prefixes, the target data is formatted with “output: …”, however, the finetuned-bart returns “outputoutput: …”. Why is this repetiton occurring? Also, does the Bart tokenizer automatically add the eos token?

c) Also does the trainer api automaticallly handle adjust_logits_during_generation and decoder_start_token_id as discussed in this post?

@valhalla Could you please help with this? This is my first project training an nlp model, and I would really appreciate any information you can offer regarding my questions.

Topic		Replies	Views
BART generation with shorter input sequences on pre-training task Models	0	309	January 25, 2023
BART - Input two sentences? Beginners	2	728	June 13, 2022
[Beginner] fine-tune Bart with custom dataset in other language? Beginners	2	3232	January 22, 2021
Convert Bart to seq to seq form 🤗Transformers	0	308	July 5, 2022
BART generate() output not related to input Intermediate	1	814	February 17, 2022

Finetuning BART on a multi-input sequence to sequence task

Related topics