Pipeline not using GPU

hannahmc · February 26, 2024, 6:13am

Hi,

I’m using a simple pipeline on Google Colab but GPU usage remains at 0 when performing inference on a large number of text inputs (according to Colab monitor).

Here’s what I’ve tried:

model = pipeline("feature-extraction", device=torch.device("cuda"))
model = pipeline("feature-extraction", device=0)

I can confirm that I’m using a T4-GPU runtime. Neither uses GPU.

Transformers version: 4.37.2
CUDA version:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2023 NVIDIA Corporation
Built on Tue_Aug_15_22:02:13_PDT_2023
Cuda compilation tools, release 12.2, V12.2.140
Build cuda_12.2.r12.2/compiler.33191640_0

Topic		Replies	Views
Pytorch NLP model doesn’t use GPU when making inference 🤗Transformers	5	14229	January 5, 2024
Gpt-neo 27 and 13 Models	2	839	June 18, 2021
GPU not being used in Spaces Deployment Spaces	3	64	January 9, 2025
Query execution with hugging face pipeline is happening on CPU, even if model is loaded on GPU 🤗Transformers	0	975	May 30, 2023
GPU Google Colab not working with langchain Models	0	545	February 16, 2024

Pipeline not using GPU

Related topics