How can we test Transformer Models after converting it to TFLite format

bhadresh-savani · October 21, 2020, 11:38am

I tried converting the Distilbert Model of Huggingface to TFLite format using this script Reference Script

I was able to convert it. After that, I wanted to test it in Python itself using tf.lite.Interpreter but I was not able to figure out the correct input dimension required for the model. Can anyone help me with this?

Colab Notebook

valhalla · October 21, 2020, 3:23pm

Pinging @jplu, as he’s working on TF

jplu · October 22, 2020, 12:52pm

Hello!

Unfortunately I’m not really familiar with TFLite sorry. As a first guess I would suggest to get your features directly from the Tokenizers and see if it works.

bhadresh-savani · October 23, 2020, 9:22am

Hello @jplu
I tested TFdistilbertModel but after conversion, it’s only expecting tensor of 3*5 dimension. The output of the tokenizer is a single-dimensional array.

I also try to set input like this

input_spec = tf.TensorSpec([1, 768], tf.int32)
model._set_inputs(input_spec, training=False)

But when I check model.inputs it was None and I was getting same error related to dimension.

jplu · October 23, 2020, 1:46pm

From what I see in your colab, your TFlite model is waiting only one input (input_ids) and you are giving two (input_ids + attention_mask). Also the method _set_inputs doesn’t work with TF >= 2.2. You have to use:

model._saved_model_inputs_spec = None
model._set_save_spec(input_spec)

When doing TFDistilBertModel.from_pretrained('distilbert-base-uncased') the model is built with a fix input, a batch of three sentences of 3 tokens (then 5 with start and end tokens). So if you want it to fit your needs when creating your savedmodel/TFlite model you have to modify this beforehand as I specified above.

haha0542 · October 26, 2020, 3:46pm

Hi @jplu

Is there a way to allow dynamic input size rather than setting a pre-fixed size? Thanks.

jplu · October 26, 2020, 4:56pm

Not for now.

bhadresh-savani · October 27, 2020, 8:19am

Thanks @jplu
By considering your suggestion by setting input shape, I was able to do inference on TFlite models converted from Huggingface.
Here is my Notebook for someone who is facing such an issue.

chaine09 · February 10, 2024, 4:23pm

@bhadresh-savani hello I could not access the notebook you provided. Could you send an updated link?

gilbertogalvis · March 26, 2024, 11:41pm

github.com

bhadreshpsavani/UnderstandingNLP/blob/master/Notebooks/TFLite/TFLiteExperiments.ipynb

{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "name": "TFLiteExperiments.ipynb",
      "provenance": [],
      "collapsed_sections": [],
      "authorship_tag": "ABX9TyPA2eFqMnkEGqgxIQ3c0DXr",
      "include_colab_link": true
    },
    "kernelspec": {
      "name": "python3",
      "display_name": "Python 3"
    },
    "accelerator": "GPU",
    "widgets": {
      "application/vnd.jupyter.widget-state+json": {
        "78a86cd1b2334981854a64e4a7aa6e1e": {
          "model_module": "@jupyter-widgets/controls",

This file has been truncated. show original

Topic		Replies	Views
Import distilbert-base-uncased tokenizer to an android app along with the tflite model 🤗Tokenizers	3	1933	December 29, 2021
Convert transformer to SavedModel Beginners	4	2566	November 30, 2021
TF transformers model inputs and outputs showing none? 🤗Transformers	1	1139	April 25, 2022
SavedModel export for DistilBERT is failing 🤗Transformers	9	507	October 9, 2020
Upload a TF model to Huggingface Intermediate	6	1064	September 1, 2021

How can we test Transformer Models after converting it to TFLite format

Related topics