Summarization: Is finetune_trainer.py accepting length arguments correctly?

stas · December 19, 2020, 12:17am

And looking some more again, it looks like val_max_target_length is used in generate and overrides model.config.max_length as you can see here:

github.com

huggingface/transformers/blob/3ff5e8955adbfcb35e58d1f5575370f0efee2b09/examples/seq2seq/seq2seq_trainer.py#L212-L224


gen_kwargs = {
    "max_length": self.data_args.val_max_target_length
    if self.data_args is not None
    else self.config.max_length,
    "num_beams": self.data_args.eval_beams if self.data_args is not None else self.config.num_beams,
}
if self.args.predict_with_generate and not self.args.prediction_loss_only:
    generated_tokens = self.model.generate(
        inputs["input_ids"],
        attention_mask=inputs["attention_mask"],
        **gen_kwargs,
    )

So actually there is a working solution now that we know which of the 4 args is used to override max_length.

I double checked that it is so with:

diff --git a/examples/seq2seq/seq2seq_trainer.py b/examples/seq2seq/seq2seq_trainer.py
index 32a96555..7d8f4741 100644
--- a/examples/seq2seq/seq2seq_trainer.py
+++ b/examples/seq2seq/seq2seq_trainer.py
@@ -216,6 +216,10 @@ class Seq2SeqTrainer(Trainer):
             "num_beams": self.data_args.eval_beams if self.data_args is not None else self.config.num_beams,
         }

+        logger.info(f"***** generate args *****")
+        for k, v in sorted(gen_kwargs.items()):
+            logger.info(f"  {k} = {v}")
+
         if self.args.predict_with_generate and not self.args.prediction_loss_only:
             generated_tokens = self.model.generate(
                 inputs["input_ids"],

So getting:

2020-12-18 16:21:38 | INFO | seq2seq_trainer | ***** generate args *****
2020-12-18 16:21:38 | INFO | seq2seq_trainer |   max_length = 50
2020-12-18 16:21:38 | INFO | seq2seq_trainer |   num_beams = 4

So overriding is happening.

But why with self.data_args.val_max_target_length only I don’t know.

So 2 possible things to do here:

either add explicit --min_gen_length and --max_gen_length args and pass those into generate or at the very least document that --val_max_target_length has double usage - one for validation dataset truncation and a secondary use for generate's max_length override.
Perhaps that comment about use task specific params should be amended to say that further overrides may happen since the info logger doesn’t report that model.config.max_length was effectively set to self.data_args.val_max_target_length and thus it is confusing to the user.

I submitted a PR that addresses these 2 items above.

So this flurry of comments cleared out what cl arg to use to override max_length, but I doubt it made any difference to your problem.

If the problem is still unresolved please help us at reproducing it. Ideally use the existing summarization datasets that we use for testing as explained here:

or if that doesn’t work please make a small sample that reproduces the problem with your data and copy-n-paste instructions to get it and deploy it. Thank you!

Topic		Replies	Views
Huge difference in speed when finetuning summarization with different scripts 🤗Transformers	4	890	August 13, 2021
BART finetuning for summarization without seq2seq trainer Beginners	1	818	October 31, 2022
Fine-tune summarization never works well Beginners	0	244	February 25, 2024
Finetuning BART for Abstractive Text Summarisation Beginners	1	5232	September 9, 2024
Finetuning T5 for Summarisation - Poor results Intermediate	1	527	April 28, 2024

Summarization: Is finetune_trainer.py accepting length arguments correctly?

Related topics