Convert bert tokenizer to onnx

hasak · July 13, 2021, 7:31am

I was referring to the following blog to convert bert model to onnx.

here, to take the inference of bert tokenizer, I’ll have to pass the 2d arrays.

Is there a way, where I’ll be able to pass sentence as input to the onnx tokenizer and get encodings as output, so that I’ll be able to use the model platform-independent

lewtun · July 13, 2021, 12:06pm

hi @hasak,

the tokenizer is independent of onnx / onnxruntime, so you could create a simple function that converts your string inputs into the numpy format that the onnxruntime session expects:

tokenizer = ...

def prepare_for_session(input: str):
    tokens = tokenizer(input, return_tensors="pt") 
    return {k: v.cpu().detach().numpy() for k, v in tokens.items()}

does this answer your question?

altozachmo · November 4, 2022, 1:26am

I think using a separate tokenizer function does work, but bundling the tokenizer into the onnx object has it’s own benefits. For example, if someone wanted to take this model and deploy it client-side in the browser using ONNX.js.

There is currently preliminary support for Node bindings for the Tokenizers library, but that doesn’t support the latest LTS versions of node and won’t without a big overhaul of it’s “neon” dependency.

fxmarty · November 4, 2022, 9:24am

Interesting suggestion @aitordiaz . The node bindings you are mentioning are tokenizers/README.md at main · huggingface/tokenizers · GitHub ?

I agree bundling a tokenizer in a format portable-friendly could be great not only for client-side in-browser inference, but other use cases as well (if we don’t want to use python for example). It seems like huge work though.

Topic		Replies	Views
How to export bert tokenizer to onnx? 🤗Transformers	0	153	June 25, 2024
How to convert HuggingFace tokenizers into ONNX format? 🤗Tokenizers	1	646	December 5, 2022
How load a Bert model from Onnx Runtime? 🤗Transformers	0	2273	July 14, 2021
How to perform tokenization on an ONNX model in JS? 🤗Tokenizers	0	838	May 6, 2022
Convert ASR to ONNX Models	0	874	February 12, 2021

Convert bert tokenizer to onnx

Related topics