How to install flash attention on HF gradio space

nxphi47 · January 26, 2024, 9:25am

I tried to put flash-attn in the requirements.txt file to install flash-attention on my space, but it gives error that torch is not installed.

I also tried to put torch above flash-attn but still couldn’t, probably torch is not installed yet.

Please help!

radames · January 30, 2024, 6:25pm

One option is to use a custom Dockerfile and install as a build step

Another option if you’re using Gradio/Streamlit SDK is to install at runtime

import subprocess
subprocess.run('pip install flash-attn --no-build-isolation', env={'FLASH_ATTENTION_SKIP_CUDA_BUILD': "TRUE"}, shell=True)

gokaygokay · June 16, 2024, 3:51pm

put this prebuilt whl inside your requirements.txt

system · June 17, 2024, 7:22pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Install nvcc / cuda kernels on hugging face spaces Spaces	0	177	August 1, 2024
SentencePiece - OSError 🔒 Gradio	2	1502	April 15, 2022
New Spaces Primitive Feature Request - Gradio plus Torch plus Transformers Docker spin up? 🤗Transformers	1	366	March 25, 2023
I trying to make first space, but I can't intstall transformers library Beginners	2	349	July 18, 2023
How Can I Install Flash Attention 2 in a ZeroGPU Space Spaces	1	268	July 30, 2025