Repetitions after pre-training T5X

wildcard00 · June 18, 2024, 8:27pm

Hello, I am pre-training T5_1_1 using the t5x pretraining script to translate to Japanese on a large corpus of text. After training, I tried to translate a simple “Hello”, but it ends up repeating the “Hello” in Japanese several times in escape unicode sequences. The number of times it repeats is equivalent to the number of task feature lengths I have defined.

Is there a setting I can tweak to reduce the number of repetitions similar to CTranslate2?
In my preprocessor for the training task, I add the EOS tokens automatically as follows:

    preprocessors=[
        seqio.preprocessors.tokenize,
        seqio.preprocessors.append_eos_after_trim,
    ],

Any tips on how to reduce repetitions?

Topic		Replies	Views
Transformers - repetition_penalty parameter Beginners	3	32733	April 4, 2025
How to add EOS when training T5? Intermediate	1	137	October 21, 2024
T5-small parameter finetuning translation task Models	0	623	June 29, 2022
T5 Finetuning Tips Models	48	56634	November 3, 2024
T5: Tips for finetuning on crossword clues (clue => answer) Models	1	629	October 14, 2020

Repetitions after pre-training T5X

Related topics