ONNX Conversion - transformers.onnx vs convert_graph_to_onnx.py

pierreguillou · October 15, 2021, 7:58pm

I’m like you: I know how to use the old method (transformers/convert_graph_to_onnx.py) but not the new one (transformers.onnx) to get the quantized onnx version of a Hugging Face task model (for example: a Question-Answering model).

In order to illustrate it, I did publish this notebook in Colab: ONNX Runtime with transformers.onnx for HF tasks models (for example: QA model) (not only with transformers/convert_graph_to_onnx.py)

Hope that @lysandre @mfuntowicz @valhalla @lewtun will have some time to complete the online documentation Exporting transformers models and/or to update microsoft tutorials about onnx.

Others topics about this subject:

There is as well the Accelerate Hugging Face models page from microsoft but the notebooks look very complicated (heavy code).

Topic		Replies	Views
ONNX conversion 🤗Transformers	0	285	July 8, 2021
LayoutLMv3 transformers.onnx 🤗Transformers	1	1343	September 15, 2022
DeBERTaV3 ONNX conversion error Intermediate	2	2044	July 25, 2022
Inference with Finetuned BERT Model converted to ONNX does not output probabilities Intermediate	3	4481	March 26, 2021
How do we quantize facebook / mbart-large-50-one-to-many-mmt to ONNX runtime 🤗Transformers	2	806	June 10, 2021

ONNX Conversion - transformers.onnx vs convert_graph_to_onnx.py

Related topics