Hi,
I’d like to train a large model on 8 gpus. But the training function is called by another python code. Is there a function to launch the distributed training from within another process?
Any advice is welcome
Thanks!
Hi,
I’d like to train a large model on 8 gpus. But the training function is called by another python code. Is there a function to launch the distributed training from within another process?
Any advice is welcome
Thanks!