Is the trainer's seed reset at every model_init?

BramVanroy · March 28, 2022, 12:33pm

Because seed is a TrainingArgument, we can use it as in hyperparameter optimization if we’d want to, e.g.

def hparams_ray(trial):
    from ray import tune

    return {
        "learning_rate": tune.loguniform(1e-6, 1e-3),
        "per_device_train_batch_size": tune.choice([4, 8, 16, 32]),
         "seed": tune.choice(range(1, 43)),
    }

But I am looking for clarification what happens when we do not include seed here. Is the seed (or any training argument for that matter), reset at each trial? This is important: if it is not reset, then every trial has a different starting seed and hyperparameter search is not independent.

sgugger · March 28, 2022, 2:22pm

Yes, it’s done at that line specifically.

BramVanroy · March 28, 2022, 3:08pm

I guess I misunderstand how Ray (or Optuna) interacts with the trainer then.

I thought that the Trainer is never copied/reinitialized but only the model is reinitialized for every trial. You seem to suggest that a new trainer is created for every trial (and in the trainer init, set_seed is called), is that correct?

sgugger · March 28, 2022, 4:52pm

Pasted the wrong link, sorry. It is reinitialized (with the seed set) at each new call to train here

BramVanroy · March 28, 2022, 4:56pm

Great, that was what I was looking for. Thanks!

Topic		Replies	Views
Fixing the random seed in the Trainer does not produce the same results across runs 🤗Transformers	5	17562	March 27, 2025
There is always something going wrong with hyper parameter tuning 🤗Transformers	4	1981	September 1, 2021
Cant reproduce Optuna results Intermediate	3	2263	January 17, 2022
Can trainer.hyperparameter_search also tune the drop_out_rate? Beginners	3	1206	May 7, 2024
Set_seed and training argument's data_seed Beginners	2	177	August 6, 2024

Is the trainer's seed reset at every model_init?

Related topics