An interesting project. Just looking for advice

TechAnalytics916 · July 6, 2025, 9:55pm

I am working on an AI server build, honestly using AI (ChatGPT) to both guide me and teach me how to build an AI server. A friend of mine invested in a pretty extravagant system. (Specs below) and here’s where im kind of at with it. Ive been working on it for about 3 months now in my spare time and its a bit difficult but i did get Mistral-7B-Instruct-v0.3 loaded, ive loaded a few models and want to make the most of my hardware (this is all experimental for me) but i really want to push what i have to the absolute limitations. i have been trying to get Mistral-7B-Instruct-v0.3 to run but it just wont start for me. Any suggestions to make the most of my hardware. Specs as follow:

CPU: AMD EPYC Genoa 96C/192T (DDR5 ECC, ~3.7 GHz boost)
Motherboard: ASRock Rack GENOAD8X-2T/BCM (PCIe 5.0, BMC/IPMI)
Memory: 512 GB DDR5 ECC Registered (8 × 64 GB SK Hynix 4800 MT/s)
GPUs: NVIDIA A100-SXM4-40GB + NVIDIA Quadro RTX 8000
System Storage: 2 × Samsung 990 EVO 4TB NVMe (ZFS rpool, bpool)
Data Storage: 8 × Intel P4510 4TB NVMe on HighPoint SSD7580B RAID (ZFS aipool ~29 TB)

John6666 · July 7, 2025, 4:06am

I think the specs are more than enough…
First, try running the sample code for Colab on the page below.

If speed is important, I recommend using Ollama (for short sentences) or TGI or vLLM (for fast inference including long sentences). These will make better use of your hardware performance.

Topic		Replies	Views
Starting a project and wanting "in" on the community Beginners	0	176	February 25, 2024
What kind of hardware do you use? Models	0	190	February 19, 2024
Beginning with all of it Beginners	1	194	March 30, 2024
AI on budget (used) hardware Show and Tell	0	400	February 5, 2024
Demo of Open Domain Long Form Question Answering Beginners	13	4520	February 8, 2021

An interesting project. Just looking for advice

Related topics