Inference API model timeout (Flan-UL2)

abhinav-neil · May 26, 2023, 7:09am

I’ve been trying to access the inference API for the model google/flan-ul2 for the past 2 days, but I keep getting an error 504: response time-out. The model is not loading (the web-UI inference doesn’t work either). The problem persists even after using my access token (pro). Any suggestions?

olafvosscv · May 26, 2023, 12:11pm

Not sure if for the same reasons, but I also couldn’t get endpoints to run on flan-t5-xxl or fan-ul2 on GPU [large] instances. And xlarge wasn’t accessible to me.

Topic		Replies	Views
We are not able to invoke Google/flan-t5-xxl Beginners	1	346	April 18, 2024
Gateway timeout - google/flan-t5-xxl Beginners	0	21	August 4, 2024
504 Timeout when using flan-ul2 Beginners	0	22	August 4, 2024
504 Timeout when using flan-ul2 Beginners	0	26	August 4, 2024
Inference API time out? Site Feedback	2	950	February 28, 2024

Inference API model timeout (Flan-UL2)

Related topics