AMD GPU for local inference

ChiefScientist · August 30, 2025, 4:47pm

I’m considering buying a graphics card and besides using it for games, I want to use it for some local AI experiments, specifically using the transformers and diffusers libraries with HuggingFace models. I currently have an AMD RX 5600XT which does not seem to be supported by rocm. I was considering mainly AMD cards because of their compatibility with games and Linux.

If I were considering another AMD card like a 7900XT 24GB or a 9070 16GB, what would that make available to me in terms of AI experiments? Which models would I be able to use, even if it’s a bit slow, and which models would still be off limits?

thanks for any advice!

John6666 · August 31, 2025, 12:04am

a 7900XT 24GB or a 9070 16GB

Both GPUs seem to have decent ROCm support. For generative AI tasks, while clock speed matters, insufficient VRAM makes running things difficult in the first place. Since VRAM is generally more critical, the 24GB model would likely be easier to work with.

When using ROCm-compatible GPUs on Linux, installing PyTorch with ROCm support (basically, most Hugging Face libraries work if PyTorch works) seems to allow usage with minimal code changes.

For non-Hugging Face tools, using vLLM and Ollama for LLMs, or ComfyUI and A1111 WebUI for image generation, should also work without issues.
Fine-tuning LLMs should also be possible.

However, I currently don’t own any AMD GPUs other than the APU, so please take this as general advice… Also, while it’s not much of an issue on Linux, Windows often has various incompatibilities. This is true even with nVidia. Since you’re using Linux, you’ll probably be fine.

Topic		Replies	Views
AMD ROCm multiple gpu's garbled output 🤗Accelerate	12	2092	July 30, 2024
Run Any Model Without GPU for AMD EPYC 7282? Beginners	0	75	August 25, 2024
Using AMD'S RocM with accelerate library 🤗Accelerate	1	806	January 24, 2024
List of AI Projects with AMD GPU support? Beginners	0	691	September 3, 2023
16 GB vs 20 GB graphics card Beginners	5	4179	October 21, 2024

AMD GPU for local inference

Related topics