Local LLM and ML platform with RTX 5090 GPU

mateomayer · August 11, 2025, 10:02am

I built a local AI workstation around an RTX 5090 (32 GB) for an uninterrupted, offline coding workflow.

OS: Debian 12 with a pinned NVIDIA .run driver (frozen for kernel stability).
LLMs: each in its own Python venv to keep the global stack clean.
Tools in a default “example-venv”: PyTorch, SciPy, NumPy, pandas, Matplotlib, scikit-learn.

Short demo + full setup notes:
→ https://localprompt.ai/demo.mp4
→ https://localprompt.ai
→ System Specifications – LocalPrompt.ai

Current favorite: DeepSeek-Coder-V2-Lite-Instruct (GGUF, Q8_0) for offline code help; I run it locally and use the venv to execute/validate.

I’d love feedback on two points:

With a 32 GB GPU, which models are you finding best in practice as a coding assistant?
For longer tasks, do you prefer a slightly smaller model with bigger context, or a stronger model accepting the risk of some forgetting of chat history?

John6666 · August 11, 2025, 11:30am

1

If you’re looking for a coding-specialized model that can be quantized to 32 GB or less (preferably 16 GB or less when considering memory for context), Qwen Coder series would be a safe bet. Devstral and NextCoder also seems promising.

mateomayer · August 11, 2025, 11:47am

Thanks a lot, I’ll try it somewhere next week and post my findings here.

aiflux · September 10, 2025, 2:27pm

How’s your experience been with driver support for native fan control with your Inno3D 5090?

I was looking at some similar RTX 5090 builds for local ai on llamabuilds.ai and it looked like most builds there prefer the reference nVidia RTX 5090 or MSI models?

Pimpcat-AU · September 10, 2025, 10:14pm

How did you resolve the issue with sm_120 support? I tried and I couldn’t get it to work. Otherwise you’re not going to be able to infer anything with it.

Deliriousintent · September 19, 2025, 4:44am

sm_120 is only supported in the torch nightly builds. I’m running on dual RTX 5070’s.
Make sure you are running cuda 13.0.1 ( CUDA 13.0 Update 1 >=580 )
and install torch nightly build

download. pytorch. org /whl/nightly/cu130

pip show torch
Name: torch
Version: 2.10.0.dev20250910+cu130

Topic		Replies	Views
Open Source LLM models I can use for P620 2GB GPU Beginners	0	786	June 16, 2023
Closest model available to OpenAI's codex/ GitHub Copilot for code completion 🤗Transformers	6	7792	August 7, 2023
Want to host a production level server for runnin llm for code generation Intermediate	0	86	January 7, 2025
Fine tune mt5 model on single gpu? Models	0	331	September 24, 2021
Pretrain GPT-Neo for Open Source GitHub Copilot Model Flax/JAX Projects	54	24081	January 18, 2022

Local LLM and ML platform with RTX 5090 GPU

Related topics