Hugging Face Forums
Speeding up T5 inference 🚀
🤗Transformers
kira
March 15, 2021, 9:29am
14
thank you!
@valhalla
. created a new thread
here
.
show post in topic
Related Topics
Topic
Replies
Views
Activity
Boost inference speed of T5 models up to 5X & reduce the model size by 3X
🤗Transformers
2
5387
June 8, 2023
How to convert mT5 and ByT5 to ONNX format?
🤗Transformers
4
1966
December 22, 2021
Improving decoding speed by onnx conversion model
Beginners
0
238
November 17, 2021
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx?
🤗Optimum
12
3935
March 7, 2024
Transformers / T5 , jit trace, script, quantize
🤗Transformers
2
552
April 18, 2023