Doubt on Tokenization in Pegasus

airnicco8 · November 30, 2020, 2:35pm

Hi, i created a 16-2 pegasus student with make_student.py then tried to use finetune.py on XSUM dataset. The script i run is:

python finetune.py --max_source_length 500 --data_dir xsum --freeze_encoder --freeze_embeds --learning_rate=1e-4 --do_train --do_predict --val_check_interval 0.1 --n_val 1000 --max_target_length=60 --val_max_target_length=60 --test_max_target_length=100 --model_name_or_path dpx_xsum_16_2 --train_batch_size=1 --eval_batch_size=1 --sortish_sampler --num_train_epochs=6 --warmup_steps 500 --output_dir distilpeg_xsum_sft_16_2 --gpus 0 --gradient_accumulation_steps 256 --adafactor --dropout 0.1 --attention_dropout 0.1 --overwrite_output_dir

The question is, is it normal if i don’t specify --max_source_length 500 i obtain an error during embedding? If i leave it like that the fine-tuning is efficient?

Thanks in advance!

airnicco8 · November 30, 2020, 4:03pm

I noticed on the script in the repo --max_source_lenght 512 is set and so i ran with such setting. But i notice that starting r2 score is 0.0 in metrics.json. Is this a problem?

Topic		Replies	Views
Error when Finetuning a Pegasus Student Beginners	0	260	November 21, 2020
Fine-tuning Pegasus Models	33	10177	October 14, 2021
Pegasus Questions 🤗Transformers	29	3955	July 5, 2021
Fine-Tuning Pegasus - Model Not Training? Models	4	1744	March 14, 2021
Finetuning Pegasus for summarization task 🤗Transformers	3	1056	October 14, 2020

Doubt on Tokenization in Pegasus

Related topics