How did you create AWS API Gateway w/o 30s timeout?

Muennighoff · April 5, 2021, 5:36am

Is huggingface using AWS Lambda & API Gateway under the hood for its hub APIs? If yes, how did you manage to have cold starts >30s?

To the best of my knowledge API Gateway has a strict timeout limit after 30s, yet when querying some models on the hub they sometimes take >30s. How did you do that?

Topic		Replies	Views
Gateway Problem Beginners	2	56	January 7, 2025
504 Gateway Time-out in Inference Endpoints Inference Endpoints on the Hub	3	681	January 23, 2025
Inquiry About 120s Timeout on Hugging Face Inference Endpoint for Llama 3.1-8B Models	1	33	March 28, 2025
Inference API timeout Site Feedback	0	185	May 29, 2024
Inference endpoint taking forever to initialize Inference Endpoints on the Hub	1	30	May 12, 2025

How did you create AWS API Gateway w/o 30s timeout?

Related topics