Hi @sgugger,
Is there is any special parameter that needs to be passed to the Trainer class to work with multi-GPU?
Please have a look at Not able to scale Trainer code to single node multi GPU - Transformers - Hugging Face Forums
Hi @sgugger,
Is there is any special parameter that needs to be passed to the Trainer class to work with multi-GPU?
Please have a look at Not able to scale Trainer code to single node multi GPU - Transformers - Hugging Face Forums