Llama 70b model not using GPU

mac011769 · September 13, 2023, 6:18am

Hello
I am using Llama2-70b chat model. My PC has Nvidia T1000 GPU with i7-12700 CPU
When I run my llama model the GPU is not getting used. The output for a simple query like translate to French is taking about 30 mins. The utilization of CPU is 100% where as the GPU usage is 1%. Can somebody help me with this? I have installed CUDA already and have added the paths

Topic		Replies	Views
Hardware Requirement GPU Beginners	3	1126	January 27, 2025
GPU Optimisation Quantised Llama x Nvidia T4 Beginners	2	211	January 8, 2025
Find LLM to run on single gpu with only 8 GB ram Models	10	7641	March 22, 2024
How to get Llama-2-13b-chat-hf to ACTUALLY RUN Beginners	0	253	May 30, 2024
Llama 2 70B on a cpu Beginners	2	6864	August 23, 2023

Llama 70b model not using GPU

Related topics