Why does the median cross entropy loss change when I change the random seed?

referring the docs I think the second classification head is randomly initialised, which could be the reason for this. pinging @patrickvonplaten .