Custom model export to onnx-runtime

maiia-bocharova · May 4, 2023, 8:37am

I modified the BertEmbeddins, BertModel and BertForTokenClassification to accept additional feature (whether token in capitalized or not), in pure transformers it all works, but I am struggling with implementing the export of this custom model (so I can optimize it with optimum and get an inference speed up)

register_for_onnx = TasksManager.create_register("onnx")

@register_for_onnx("custom-bert", "token-classification")
class CustomOnnxConfig(TextEncoderOnnxConfig):
    # Specifies how to normalize the BertConfig, this is needed to access common attributes
    # during dummy input generation.
    NORMALIZED_CONFIG_CLASS = NormalizedTextConfig
    # Sets the absolute tolerance to when validating the exported ONNX model against the
    # reference model.
    ATOL_FOR_VALIDATION = 1e-4


    def inputs(self) -> Dict[str, Dict[int, str]]:
        if self.task == "multiple-choice":
            dynamic_axis = {0: "batch_size", 1: "num_choices", 2: "sequence_length"}
        else:
            dynamic_axis = {0: "batch_size", 1: "sequence_length"}
        return {
            "input_ids": dynamic_axis,
            "attention_mask": dynamic_axis,
            "capitalization_ids": dynamic_axis,
            "token_type_ids": dynamic_axis,
        }



base_model = CustomBertForTokenClassification.from_pretrained("my-checkpoint")

onnx_path = Path("model.onnx")

Here I do not understand what to do next
base_model.config returns BertConfig, which I think I need to overwrite with the custom config I created in the previous step.

Can you please help me?

michaelbenayoun · May 4, 2023, 3:45pm

First, I can see that your new model will have a new input. We need a DummyInputGenerator that can handle this. So you could try something like:

from optimum.utils import DummyTextInputGenerator

class MyDummyTextInputenerator(DummyTextInputGenerator):
    SUPPORTED_INPUT_NAMES = (                                                                                                                                                               
        "input_ids",                                                                                                                                                                        
        "attention_mask",                                                                                                                                                                   
        "token_type_ids",
        "capitalization_ids"
      )

class CustomOnnxConfig(TextEncoderOnnxConfig):
    # Specifies how to normalize the BertConfig, this is needed to access common attributes
    # during dummy input generation.
    NORMALIZED_CONFIG_CLASS = NormalizedTextConfig
    DUMMY_INPUT_GENERATOR_CLASSES = (MyDummyTextInputenerator,)
    # Sets the absolute tolerance to when validating the exported ONNX model against the
    # reference model.
    ATOL_FOR_VALIDATION = 1e-4

    @property
    def inputs(self) -> Dict[str, Dict[int, str]]:
        if self.task == "multiple-choice":
            dynamic_axis = {0: "batch_size", 1: "num_choices", 2: "sequence_length"}
        else:
            dynamic_axis = {0: "batch_size", 1: "sequence_length"}
        return {
            "input_ids": dynamic_axis,
            "attention_mask": dynamic_axis,
            "capitalization_ids": dynamic_axis,
            "token_type_ids": dynamic_axis,
        }

Since an OnnxConfig already exists for bert, the register method will not do anything.
If you want to be able to overwrite existing register configurations you can do that:

register_for_onnx = TasksManager.create_register("onnx", overwrite_existing=True)

Another approach is to do it programmatically: you can first export your model using Python code and then optimize it using the CLI.

from pathlib import Path
from optimum.exporters.onnx import export

export(
    model,
    CustomOnnxConfig(model.config),
    Path("my_custom_model_onnx"),
)

maiia-bocharova · May 4, 2023, 4:53pm

Thank you a lot! I am sorry, I still have issues

from optimum.utils import DummyTextInputGenerator
from optimum.exporters.onnx.config import OnnxConfig


class MyDummyTextInputenerator(DummyTextInputGenerator):
    SUPPORTED_INPUT_NAMES = (                                                                                                                                                               
        "input_ids",                                                                                                                                                                        
        "attention_mask",                                                                                                                                                                   
        "token_type_ids",
        "capitalization_ids"
      )
    
class TextEncoderOnnxConfig(OnnxConfig):
    # Describes how to generate the dummy inputs.
    DUMMY_INPUT_GENERATOR_CLASSES = (DummyTextInputGenerator,)


register_for_onnx = TasksManager.create_register("onnx", overwrite_existing=True)

@register_for_onnx("bert", "token-classification")
class CustomOnnxConfig(TextEncoderOnnxConfig):
    # Specifies how to normalize the BertConfig, this is needed to access common attributes
    # during dummy input generation.
    NORMALIZED_CONFIG_CLASS = NormalizedTextConfig
    DUMMY_INPUT_GENERATOR_CLASSES = (MyDummyTextInputenerator,)
    # Sets the absolute tolerance to when validating the exported ONNX model against the
    # reference model.
    ATOL_FOR_VALIDATION = 1e-4

    @property
    def inputs(self) -> Dict[str, Dict[int, str]]:
        if self.task == "multiple-choice":
            dynamic_axis = {0: "batch_size", 1: "num_choices", 2: "sequence_length"}
        else:
            dynamic_axis = {0: "batch_size", 1: "sequence_length"}
        return {
            "input_ids": dynamic_axis,
            "attention_mask": dynamic_axis,
            "capitalization_ids": dynamic_axis,
            "token_type_ids": dynamic_axis,
        }

model = CustomBertForTokenClassification.from_pretrained("my-checkpoint")

from pathlib import Path
from optimum.exporters.onnx import export

export(
    model,
    CustomOnnxConfig(model.config),
    Path("model.onnx"),
)

I get error RuntimeError: number of output names provided (1) exceeded number of outputs (0)

michaelbenayoun · May 5, 2023, 9:41am

I think this might be linked to this issue.
Could you try installing optimum from sources and re-run your script?

maiia-bocharova · May 5, 2023, 10:44am

Did installation like this:

pip install transformers[onnx]==4.28.1
python -m pip install git+https://github.com/huggingface/optimum.git

But the error stayed same RuntimeError: number of output names provided (1) exceeded number of outputs (0)

michaelbenayoun · May 9, 2023, 6:35pm

Hi Maiia,
I saw your message here, thanks for creating a notebook, it really helped me to try things out.

You do not need to register anything since you are not using the TasksManager in the end. Creating your OnnxConfig and using the export function is enough.

But because you are not using the TasksManager, you have to instantiate the OnnxConfig manually, and you need to specify the name of the task when doing so, otherwise it will infere the task to be the default one (BertModel and not BertForTokenClassification).

So to be able to export your model the fix is actually easy:

export(
    model,
    CustomOnnxConfig(model.config, task="token-classification"),
    Path("model.onnx"),
)

maiia-bocharova · May 9, 2023, 7:02pm

Thank you a lot! You helped me so, so much!

Last question - if I need to reload it (as OrtModelForTokenClassification to optimize with ORTOptimizer) I probably need to save a modified config? (If a save original config and do

reloaded_model = ORTModelForTokenClassification.from_pretrained("onnx")
inputs = {k: torch.zeros([2, 16], dtype=torch.long) for k in ["input_ids", "attention_mask", "capitalization_ids", "token_type_ids"]}
reloaded_model(**inputs)

I get an error about ValueError: Model requires 4 inputs. Input Feed contains 3

Or maybe I just need to modify OrtModelForTokenClassification class and it will fix the issue…

maiia-bocharova · May 9, 2023, 7:16pm

Modified OrtForTokenClassification - and everything works perfectly now!

Thank you so, so, so much for patience and kindness and spending time on a beginner!

And for wonderful work which Huggingface gives people! Thank you!!!

Topic		Replies	Views
Exporting imported BERT model to ONNX 🤗Transformers	0	2236	February 14, 2022
Unexpected input type after export 🤗Transformers	0	115	March 18, 2024
How to export bert tokenizer to onnx? 🤗Transformers	0	151	June 25, 2024
Getting ValueError when exporting model to ONNX using optimum 🤗Optimum	16	5048	November 25, 2022
ValueError: Unable to generate dummy inputs for the model. Please provide a tokenizer or a preprocessor Intermediate	0	546	April 28, 2023

Custom model export to onnx-runtime

Related topics