Has Anyone Successfully Deployed FLUX on Hugging Face Inference Dedicated Endpoint?

chihebnabil · September 8, 2024, 7:49pm

Hey everyone,

I’m currently working on deploying FLUX on a Hugging Face Inference Dedicated Endpoint, but I’ve run into a few challenges. I wanted to check if anyone here has had success deploying it.

If you’ve managed to do it, I’d love to hear about your setup, tips, and any resources you found helpful. Specifically, I’m looking for insights on model optimization, handling large parameter sizes, or any custom configurations you used.

John6666 · September 9, 2024, 3:27am

Unfortunately, there is an example of same failure, if you don’t mind.
I have never deployed to Endpoint, but deploying to Endpoint should be similar to deploying to the Serverless Inference API in terms of what needs to be done.
I am aware that they both start with the conversion of safetensors files to Diffusers format and uploading to HF.

I’ve gotten that far, but no matter how I try, the inference doesn’t work… it would work if this were SDXL or SD1.5, and it would work as well if it were frequently used and cached on the HF server, like FLUX.1 dev in BFL itself, but that’s impossible.

I think it’s fine to load from Zero GPU space, but it just doesn’t make sense.

nielsr · September 9, 2024, 7:28am

Should be easy based on this guide: Controlled text-to-image generation with ControlNet on Inference Endpoints.

I used that guide to deploy some custom Stable Diffusion models as well.

Regarding model optimization of FLUX, highly recommend this repo: GitHub - sayakpaul/diffusers-torchao: End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).

Topic		Replies	Views
Integration and Scale Inference Endpoints on the Hub	2	53	September 11, 2024
Cannot Setup Mixtral Models and Other Models on Inference Endpoints Inference Endpoints on the Hub	1	408	December 22, 2023
504 error with serverless HF Inference API Inference Endpoints on the Hub	1	35	March 17, 2025
Increase quota for Inference Endpoint Inference Endpoints on the Hub	4	179	January 31, 2025
About the Inference Endpoints on the Hub category Inference Endpoints on the Hub	3	1652	May 8, 2025

Has Anyone Successfully Deployed FLUX on Hugging Face Inference Dedicated Endpoint?

Related topics