How can I use the Trainer of HuggingFace to fine-tune a model of about 1.47B parameters, using two servers (nodes) each with 2 GPUs of RTX 8000 48GB?
Thank you
How can I use the Trainer of HuggingFace to fine-tune a model of about 1.47B parameters, using two servers (nodes) each with 2 GPUs of RTX 8000 48GB?
Thank you