I was finetuning Roberta on a multi-label classification problem 10 times and keeping track of each evaluation F1 score. However, even when randomizing the selection of training and testing data, as well as seed numbers (i.e. transformers.set_seed, numpy, tf, tf.keras, random), I find that towards t…

Repeating eval-F1 scores with seed + data randomization

John6666 July 1, 2025, 9:00pm 3

I wonder if the model output has reached the ideal value…

Topic		Replies	Views
Multiple training will give exactly the same result except for the first time 🤗Transformers	1	3591	July 19, 2021
BERT Model always with different results Beginners	0	444	January 17, 2022
Regarding the seed in HF trainer 🤗Transformers	0	333	June 14, 2022
Different BERT results Beginners	1	1194	May 25, 2022
Trainer.evaluate() 🤗Transformers	3	6889	May 11, 2021