Shuffling and buffer size

Besides using the “shuffle” buffer, we also shuffle the shards (underlying data files; dataset.n_shards returns the number of them) for more randomness, which leads to a different result unless the seed is fixed.

1 Like