Support for exporting generate function to ONNX?

nifarn · August 10, 2022, 8:52pm

I was wondering if huggingface provided any support for exporting the generate function from the transformers library to ONNX?

Mainly, I was trying to create an ONNX model using a GPT2 style transformer in order to speed up inference when generating replies to a conversation.

I see there’s some support for exporting a single call to GPT2, but not the entire for loop used in greedy decoding/beam search/nucleus sampling etc.

Topic		Replies	Views
How does the ONNX exporter work for GenerationModel with `past_key_value`? 🤗Optimum	9	2439	February 17, 2023
How to export mT5 model to onnx/torchscript and use it? 🤗Transformers	0	468	June 5, 2022
Using onnx for text-generation with GPT-2 🤗Transformers	4	4095	February 3, 2023
Model with Genrate method to torchscript Models	2	38	March 12, 2025
Generate() method for models converted to torchscript Models	2	761	August 1, 2023