Train Bart for Conditional Generation (e.g. Summarization)

niksqwerty · September 16, 2021, 4:57am

lvwerra:

training_args = TrainingArguments(
    output_dir='./models/bart-summarizer',          
    num_train_epochs=1,           
    per_device_train_batch_size=1, 
    per_device_eval_batch_size=1,   
    warmup_steps=500,               
    weight_decay=0.01,              
    logging_dir='./logs',          
)

As per the documentation, labels field is only for “Labels for computing the masked language modeling loss.”, Not specifying ‘labels’ doesn’t provide the loss values, how to compute loss if I want to train only for text summarization?

I am sending encoder_input_ids as paragraph, decoder_input_ids as summary also adding attention_mask_target for both.

Topic		Replies	Views
Inference/prediction ValueError using BART 🤗Transformers	0	323	April 17, 2022
Pretraining BART for conditional generation 🤗Transformers	1	1029	May 30, 2022
[HELP]Bart summarization output exactly the same as labels 🤗Transformers	3	874	August 4, 2021
Fine-Tune BART using "Fine-Tuning Custom Datasets" doc Beginners	6	9426	October 28, 2020
Question regarding training of BartForConditionalGeneration Models	1	2046	March 2, 2021

Train Bart for Conditional Generation (e.g. Summarization)

Related topics