How to increase the length of the summary in Bart_large_cnn model used via transformers.Auto_Model_frompretrained?

gildesh · November 12, 2021, 8:09am

Hello, I used this code to train a bart model and generate summaries

However, the summaries are coming about to be only 200-350 characters in length.
Is there some way to increase that length?

What I thought was the following options: -

encoder_max_length = 256 # demo

decoder_max_length = 64

which are used here: -

def batch_tokenize_preprocess(batch, tokenizer, max_source_length, max_target_length):

    source, target = batch["document"], batch["summary"]

    source_tokenized = tokenizer(

        source, padding="max_length", truncation=True, max_length=max_source_length

    )

    target_tokenized = tokenizer(

        target, padding="max_length", truncation=True, max_length=max_target_length

    )

    batch = {k: v for k, v in source_tokenized.items()}

    # Ignore padding in the loss

    batch["labels"] = [

        [-100 if token == tokenizer.pad_token_id else token for token in l]

        for l in target_tokenized["input_ids"]

    ]

    return batch

train_data = train_data_txt.map(

    lambda batch: batch_tokenize_preprocess(

        batch, tokenizer, encoder_max_length, decoder_max_length

    ),

    batched=True,

    remove_columns=train_data_txt.column_names,

)

Also, another parameter could be :- the max_length in the model.generate() function.

def generate_summary(test_samples, model):

    inputs = tokenizer(

        test_samples["document"],

        padding="max_length",

        truncation=True,

        max_length=encoder_max_length,

        return_tensors="pt",

    )

    input_ids = inputs.input_ids.to(model.device)

    attention_mask = inputs.attention_mask.to(model.device)

    outputs = model.generate(input_ids, attention_mask=attention_mask)

    output_str = tokenizer.batch_decode(outputs, skip_special_tokens=True)

    return outputs, output_str

Which one of these should I alter to increase the length of the summary?

gildesh · November 15, 2021, 10:51am

Hello, can anyone help
Bump

Topic		Replies	Views
[T5] How to control the lenth of the generated summaries 🤗Transformers	0	34	July 26, 2024
T5 Gen Len is only 1/14 of max_target_length Beginners	3	729	October 5, 2023
Output truncation of summaries models 🤗Transformers	0	441	March 30, 2023
Why is transformer decoder always generating output of same length as gold labels? 🤗Transformers	0	573	September 23, 2022
How to change max_length of a fine tuned model 🤗Transformers	4	11410	May 11, 2024

How to increase the length of the summary in Bart_large_cnn model used via transformers.Auto_Model_frompretrained?

Related topics