HuggingFace Inference endpoint 504 error

jccscop · January 23, 2024, 3:22pm

Hello! I tried to set up a inference endpoint for a private model to do a transcription with diarization of mp3 files. It worked with files of 15 min when called using the python code to do requests. But today it does not work anymore for the same files, and return error 504. With a smaller file (2min of duration) the request returns the correct answer. When I am lookin at the logs on the endpoint it seems that the whole pipeline is running. Do you have any idea how to fix that?

michellehbn · January 25, 2024, 12:52pm

Hi @jccscop, Thanks for reporting - we’ve just applied a fix and this should now be working as expected. Please let us know though if you continue to see an error. Thanks again!

jccscop · January 29, 2024, 4:45pm

Hello! Thanks for your help! It works now for the file of 15 minutes but it seems to fail for files of 40 minutes of duration. Now when running the code with python requests , the code hangs forever whereas the logs of the endpoint show that the code went to the end and did the post request. The duration of the run are not too long, it takes 2min for the 15 min audio file and 4-5 min for the 40 minutes audio files usually.

michellehbn · January 30, 2024, 10:46am

Hi @jccscop, Thanks for reporting. We’ve applied a fix though please let us know if you continue to see an issue. Thanks!

Topic		Replies	Views
HF Inference API: 503/504 Server Error Inference Endpoints on the Hub	1	225	April 1, 2025
504 Gateway Time-out in Inference Endpoints Inference Endpoints on the Hub	3	681	January 23, 2025
"Bad Request: Your endpoint is in error, check its status on endpoints.huggingface.co Models	4	183	June 16, 2025
How to use Inference API to perform speech recognition Beginners	1	209	October 12, 2024
504 Gateway Time-out in Inference Server Endpoints Inference Endpoints on the Hub	6	1840	December 21, 2023

HuggingFace Inference endpoint 504 error

Related topics