Language generation with torchscript model?

Ankita · April 25, 2021, 12:39pm

Has anyone figured out a way to run inference (same as the .generate() method) for seq-to-seq models on Elastic Inference?
I am trying to run inference for Seq-to-Seq models (like BART, Pegasus) on Elastic Inference with EC2.
So far, I have been able to use the TorchScript example (1) and store the model, but unable to figure out how to run the inference on it.

(1) Export to ONNX

Topic		Replies	Views
Support for exporting generate function to ONNX? 🤗Transformers	7	2314	February 8, 2023
Model with Genrate method to torchscript Models	2	38	March 12, 2025
How to export facebook/mbart-large-50-many-to-many-mmt to TorchScript format? Beginners	8	57	December 17, 2024
Generate() method for models converted to torchscript Models	2	761	August 1, 2023
Torchscript with Encoder-Decoder architecture Intermediate	0	297	October 11, 2021

Language generation with torchscript model?

Related topics