124gb vram model recommendation

Komet3 · June 11, 2025, 12:51pm

Howdy,

I am building a server with 124 gb vram of 5 k80 and 1 quadro k500. Running off 2 Xeon e5-2690v4 cpu with 192 GB ram server with Proxmox installed for the use case of running 2 to 3 homes with home assistant, running an offline mobile llm, and running local coding assistant. Would this suffice for my use case? What models would you recommend?

John6666 · June 12, 2025, 4:02am

It seems that even fairly small models can be suitable for certain applications. With those specifications, it should be possible to run multiple 32B models simultaneously… Is that overkill?

Larger models tend to become more versatile, so I think that Instruct models (models designed for chatbots) of 7B or larger should be suitable for most applications…

Topic		Replies	Views
Best LLMs that can run on 4gb VRAM Beginners	2	2920	January 22, 2025
How can I search for models, sorted in order of required vram? Site Feedback	0	401	January 28, 2023
Hello experts please help on running local DeepSeek-R1-0528-Qwen3-8B Beginners	2	70	June 10, 2025
Which model is best for code generation under [b]10GB[/b] Beginners	4	741	June 20, 2025
Local HW specs for Hosting meta-llama/Llama-3.2-11B-Vision-Instruct 🤗Transformers	4	1663	October 28, 2024

124gb vram model recommendation

Related topics