Gradient accumulation: should I duplicate data?

Hm, good question. Tagging @patrickvonplaten who created that notebook.