Hello there, @meganariley ma’am. A lot of the issues I and many others are experiencing are with text-to-image models. The base SDXL models, as well as SD 3.5 and FLUX are still working fine, but any fine tunes such as those based on Illustrious or Pony for SDXL no longer work with the HF inference API. The models I uploaded for use in my CPU based Gradio Spaces were working fine early April, but then this issue emerged, at first giving an error along the lines of “model inference is not supported HF inference”, and now my spaces state a “404: API not found” error. The most I can state about this error is that most if not all fine tuned text-to-image models used to work prior to the rework of the HF inference API some time last year, and while some functionality was restored over the last few months, the HF inference API no longer works as intended for a majority of the models, and I do not possess the hardware needed for local operation, nor do I have the budget to justify dedicated GPUs for my Gradio spaces. While I’m sure I can modify my code with the help of chatbots like Qwen 3.0 to use a third party API like Replicate, I would love to see the original functionality of the HF inference API restored, assuming this isn’t actually an error and rather that the API was disabled to prevent general use and access by public spaces while you all revamp the API. I can’t really give you specific model examples for what models aren’t working, as the number of models effected easily approaches 20,000 at least. We would love a solution that doesn’t require a PRO membership to implement.
1 Like