It starts training on multiple GPU’s if available. You can control which GPU’s to use using CUDA_VISIBLE_DEVICES
environment variable i.e if CUDA_VISIBLE_DEVICES=1,2
then it’ll use the 1 and 2 cuda devices. Pinging @sgugger for more info.
1 Like