Deploying Seq2Seq using ONNX on GPU

kailashkarthik9 · March 24, 2022, 3:34pm

Has anyone deployed a seq2seq model by converting it to ONNX?

What I want to do:

Any help on this would be appreciated

Topic		Replies	Views
Questions about ONNX 🤗Transformers	4	3099	January 25, 2022
T5 inference performance Models	5	1563	March 8, 2022
How to optimize ONNX seq2seq model? 🤗Optimum	2	2130	August 25, 2022
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx? 🤗Optimum	12	4570	March 7, 2024
ONNX Flan-T5 Model OOM on GPU 🤗Optimum	2	2632	June 15, 2023