Custom Distilbert does not use CUDA for predition

Krzysztof · July 26, 2021, 2:53pm

Hi,

I am using the model classifier = pipeline(“text-classification”,model=‘bhadresh-savani/distilbert-base-uncased-emotion’, return_all_scores=True) for emotion classification of my datatset of 1.2m client feedbacks. I am not training the model, I am just doing a prediction. I noticed that the model uses CPU and does not use CUDA (I have RTX5000) and the prediction takes ages to compute.
Can you explain why it is the case? Is there a way to use CUDA for this model predictions?

Thank you
Krzysztof

Krzysztof · July 27, 2021, 3:42pm

I found a solution, add parameter device=0. But I have to classify in small batches as the GPU RAM is a serious limit, even my GPU has 16 GB

olaffson · July 27, 2021, 6:17pm

very interesting, I had the same issue. How do you change the batch size with a pipeline?

Krzysztof · July 27, 2021, 6:44pm

This is probably primitive but it works. I did it in the loop. It took six hours to classify 1.2 mln reviews, some quite long, on GPU with 16GB memory.

classifier = pipeline(“zero-shot-classification”,
model = “typeform/distilbert-base-uncased-mnli”,
device = 0
)

emo_zs_labels = list()
ind = 0
lenss = len(sentences)

while ind < lenss:
bb = ind
ee = bb+seq_len if bb+seq_len < lenss else lenss
pred = classifier(sentences[bb:ee],
candidate_labels=[“sadness”, “joy”, “love”, “anger”, “fear”, “surprise”],
)
temp_labels = [x[‘labels’][0] for x in pred]
emo_zs_labels.append(temp_labels)
ind = ee
print(ind)

I experimented with seq_len, caused CUDA out of memory for 1000 and 500, so I finally set it 100.

olaffson · July 27, 2021, 6:48pm

got it, you created the batches yourself. I was curious to know whether pipeline would handle this but apparently it does not.

Just a question: what is the advantage of using a pipeline vs. using a tokenizer, loading the model, fine-tuning it and finally classifying? have you tried?

Krzysztof · July 27, 2021, 7:21pm

Yes. Pipeline takes just few lines of code to train or predict. But with tokenizer etc. I feel I have more control over the hyperparameters.

olaffson · July 27, 2021, 7:40pm

interesting, i thought pipeline had little hyperparameters… how do you access/change them in your example?

Krzysztof · July 27, 2021, 8:01pm

That is what I am saying. If you do tokeknizer, model and trainer separately you have better control of parameters. And if you do it in native Pytorch, even more. But then instead of 5 lines of code you have few 100+

Topic		Replies	Views
AutoModel Classifier distilBERT on Parallel GPUs Intermediate	0	36	November 13, 2024
RuntimeError: CUDA out of memory even with simple inference Beginners	1	5374	January 16, 2022
Utilizing GPU in masked language modelling in Tensorflow 🤗Transformers	0	307	February 27, 2023
Pipeline on GPU Beginners	0	495	October 15, 2023
Finetuning Wave2Vec vs. Finetuning Distilbert Beginners	1	379	May 31, 2023

Custom Distilbert does not use CUDA for predition

Related topics