Is huggingface using AWS Lambda & API Gateway under the hood for its hub APIs? If yes, how did you manage to have cold starts >30s?
To the best of my knowledge API Gateway has a strict timeout limit after 30s, yet when querying some models on the hub they sometimes take >30s. How did you do that?