How to run Hunyuan-A13B on a RTX 5090 / Blackwell?

Hi folks!

Since the launch of Hunyuan-A13B, I’ve been struggling to get it running on an RTX 5090 with 32 GB of RAM. The official Docker images from Tencent don’t seem to be compatible with the Blackwell architecture. I even tried building vLLM from source via git clone, but no luck either.

Any hints?

1 Like

I often use a stable version of PyTorch, and stable versions are often specified in Dockerfile. However, PyTorch 2.7 or later is required for Blackwell-generation GPUs. If there are no bugs in the model or libraries, installing PyTorch 2.7 or later with pip install in the Dockerfile should make it work.