Hardware Requirement GPU

John6666 · January 27, 2025, 4:05am

It has 192GB of VRAM!
Ollama’s default is Q4_K_M (if you don’t specify otherwise, it uses this, which is unlikely to cause problems), so I think it will be more than enough. The amount of consumption is a little over 64GB for the model, plus a little for inference. If there are a lot of people using it at the same time, or if you want to process very long sentences, it will use more VRAM, but it will still be unlikely to cause problems.
By the way, Ollama is also fast enough, but Llamacpp seems to be even faster. If you’re having trouble with speed, you might want to try changing it.

Topic		Replies	Views
Requirements Llama2 Intermediate	0	289	April 13, 2024
Identify model requirements in memory and disk Models	1	56	July 26, 2025
Should I just get more RAM? Beginners	4	2504	December 22, 2024
Local HW specs for Hosting meta-llama/Llama-3.2-11B-Vision-Instruct 🤗Transformers	4	1813	October 28, 2024
How to run large LLMs like Llama 3.1 70B or Mixtral 8x22B with limited GPU VRAM? Beginners	2	1813	September 26, 2024

Hardware Requirement GPU

Related topics