Run_mlm_wwm.py learning_rate confusion

jbmaxwell · November 17, 2021, 6:02pm

I want to run (or resume) the run_mlm.py script with a higher learning rate, but it doesn’t seem like setting it in the script arguments does anything.

os.system(
    f"python {script} \
        --model_type {model} \
        --config_name './models/{model}/config.json' \
        --train_file './content/{data}/train.txt' \
        --validation_file './content/{data}/test.txt' \
        --learning_rate 6e-4 \
        --weight_decay 0.01 \
        --warmup_steps 6000 \
        --adam_beta1 0.9 \
        --adam_beta2 0.98 \
        --adam_epsilon 1e-6 \
        --tokenizer_name './tokenizer/{model}' \
        --output_dir './{out_dir}' \
        --do_train \
        --do_eval \
        --num_train_epochs 40 \
        --overwrite_output_dir {overwrite} \
        --ignore_data_skip"
)

After warm-up, the log indicates that the learning rate tops out at 1e-05—a default from somewhere, I guess, but I’m not sure where:

{'loss': 3.9821, 'learning_rate': 1e-05, 'epoch': 0.09}

Topic		Replies	Views
Script run_mlm.py line by line 🤗Transformers	1	676	January 24, 2022
MLM train loss is very different after version update 🤗Transformers	1	438	August 29, 2021
Opinion: Training Argument Fine Tuning MLM RoBERTa Intermediate	1	188	January 9, 2025
I'm making ROBERTA dumber, and I don't know why Beginners	1	341	March 8, 2021
Resuming training BERT from scratch with run_mlm.py Intermediate	2	2205	October 31, 2021

Run_mlm_wwm.py learning_rate confusion

Related topics