How to get Accelerated Inference API for T5 models?

Hi @pierreguillou ,

Do you have a customer plan ? Optimizations are not on for non customers, leading to you not seeing the proper header.

Also keep in mind as mentioned in the docs, that for customers we’re usually able to go beyond the default depending on the load and requirements.

Cheers.