Set_seed and training argument's data_seed

If I specify the set_seed in the trainer utility, does it also fix the data_seed in TrainingArguments?

To ensure reproducibility, is using set_seed alone sufficient, or should I also define data_seed?

data_seed (int, optional):
Random seed to be used with data samplers. If not set, random generators for data sampling will use the
same seed as seed. This can be used to ensure reproducibility of data sampling, independent of the model
seed.

1 Like

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.