Transformers and Hyperparameter search using Optuna

Ok yes, this clarifies the issue. The dataset I had would result in 1 epoch=500 steps, and because the default number of steps to log when logging_strategy="steps" was 500, I thought that this logs anyways every 500 steps. Thank you very much for this!

Yes, you understood correctly, thank you for the clarifications on this too.

Also, for the trainer.state.log_history, just to underline that it contains only the log of the last trial. So, as you stated, I understand there is not an obvious way to get the logs for the wholes study.

Thank you very much once again for the help!