Any idea on why flash attention installation with AMD gpu results in metadata-generation-failed?

MonisshaD · October 15, 2024, 11:58am

I’m trying to run my fine-tuned model in setonix(supercomputer with AMD MI250 GPUs). But for some reason, I always end up with errors like metadata generation failed with the flash attention package.

John6666 · October 15, 2024, 12:47pm

In an environment without CUDA I do this.

import subprocess
subprocess.run('pip install flash-attn --no-build-isolation', env={'FLASH_ATTENTION_SKIP_CUDA_BUILD': "TRUE"}, shell=True)

Topic		Replies	Views
Load Phi 3 small on Nvidia Tesla V100 - Flash Attention 🤗Transformers	3	952	August 6, 2024
Phi3 Mini 4k Instruct Flash Attention not found 🤗Transformers	4	5035	May 11, 2024
Finetuning a small LLM on 32GB, 4vCPU 🤗Transformers	0	174	July 12, 2024
AMD GPU Stablediffusion issue Beginners	0	830	December 20, 2022
CUDA OUT OF MEMORY on MULTI GPU 🤗Transformers	0	713	February 28, 2024

Any idea on why flash attention installation with AMD gpu results in metadata-generation-failed?

Related topics