How to train my model on multiple GPU

shrijayan · March 4, 2024, 7:49am

I am trying to train the Sentence Transformer Model named cross-encoder/ms-marco-MiniLM-L-12-v2 where When I try to train it utilizes only one GPU, where in my machine I have two GPUs. I tried DataParallel and DistributedDataParallel, but didn’t Work.

from sentence_transformers import SentenceTransformer, losses
from torch.utils.data import DataLoader

# Replace 'model_name' and 'max_seq_length' with your actual model name and max sequence length
model_name = 'your_model_name'
max_seq_length = your_max_seq_length

# Load SentenceTransformer model
model = SentenceTransformer(model_name)
model.max_seq_length = max_seq_length

# Replace 'train_examples' with your actual training examples
train_examples = your_train_examples

# Create DataLoader for training
train_dataloader = DataLoader(train_examples, batch_size=16, drop_last=True, shuffle=True)

# Define the loss function
train_loss = losses.MarginMSELoss(model)

# Tune the model
model.fit(train_objectives=[(train_dataloader, train_loss)], epochs=500, warmup_steps=int(len(train_dataloader) * 0.1))

# Replace 'output_path' with the desired path for saving the trained model
output_path = 'your_output_path'

# Save the model after training
model.save(output_path)

giuseppe-trimigno · March 5, 2024, 8:36am

Consider taking a look at Accelerate library, you can train on multiple GPUs with few changes in your code.

shrijayan · March 6, 2024, 9:12am

Will checkout this.

Topic		Replies	Views
How to use Multiple GPUs in parallel in fine-tuning cross encoder model Models	0	222	January 23, 2024
Training using multiple GPUs Beginners	20	18189	February 25, 2024
Using 3 GPUs for training with Trainer() of transformers 🤗Transformers	2	1328	October 18, 2023
Model Parallelism, how to parallelize transformer? Beginners	3	10304	June 18, 2021
Trying to train BERT using input_embeds as data fails on multiple GPUs 🤗Transformers	0	683	November 17, 2021

How to train my model on multiple GPU

Related Topics