Optimal ratio for eval_steps

alimotahharynia · April 18, 2024, 6:55pm

I want to fine-tune the GPT-2 model on my specific dataset, with a total of 500,000 training steps and a learning rate of 5e-4. I set the eval_steps to 5000, which means the model will be updated every 1% of the total training steps. Although, I know that the optimal frequency for eval_steps depends on the specific dataset and other hyperparameters, given my limited computational power, I am looking for a starting point to fine-tune the model effectively. I’ll be thankful if anyone could help me with this.

Topic		Replies	Views
Why save_steps should be a round multiple of eval_steps when load_best_model_at_end=True? 🤗Transformers	3	3855	October 18, 2021
Training Arguments - eval_step vs save_step Models	2	2796	March 18, 2021
Evaluation step take longer then training step Intermediate	0	860	October 23, 2023
How to determine the number of warm-up steps? Beginners	0	390	April 18, 2024
Batch size in trainer eval loop DeepSpeed	3	4603	April 22, 2022

Optimal ratio for eval_steps

Related topics