Train Bart for Conditional Generation (e.g. Summarization)

As per the documentation, labels field is only for “Labels for computing the masked language modeling loss.”, Not specifying ‘labels’ doesn’t provide the loss values, how to compute loss if I want to train only for text summarization?

I am sending encoder_input_ids as paragraph, decoder_input_ids as summary also adding attention_mask_target for both.