Hi,
It seems that in a single-node multi-GPU env, all GPUs are automatically used. I’m wondering what distributed strategy is used? DP? DDP? I didn’t find any documentation describing related behaviors.
Thanks
Hi,
It seems that in a single-node multi-GPU env, all GPUs are automatically used. I’m wondering what distributed strategy is used? DP? DDP? I didn’t find any documentation describing related behaviors.
Thanks