Convert transformer to SavedModel

aishutin · July 17, 2020, 9:26am

Hi! I found out that this is common unresolved problem.

So, I need to convert transformers’ DistilBERT to TensorFlows SavedModel format. I've converted it, but I cant inference it.

Conversion code

import tensorflow as tf
from transformers import TFAutoModel, AutoTokenizer
dir = "distilbert_savedmodel"

model = TFAutoModel.from_pretrained('distilbert-base-uncased')
model.save(dir)

Inference code

tokenizer = AutoTokenizer.from_pretrained('distilbert-base-uncased')
encoded = tokenizer.encode('Hello, world!', add_special_tokens=True, return_tensors="tf")
model = tf.keras.models.load_model(dir)
model(encoded)

Error


ValueError: Could not find matching function to call loaded from the SavedModel. Got:
  Positional arguments (1 total):
    * Tensor("inputs:0", shape=(1, 6), dtype=int32)
  Keyword arguments: {'training': False}

Expected these arguments to match one of the following 4 option(s):

Option 1:
  Positional arguments (1 total):
    * {'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}
  Keyword arguments: {'training': False}

Option 2:
  Positional arguments (1 total):
    * {'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}
  Keyword arguments: {'training': True}

Option 3:
  Positional arguments (1 total):
    * {'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='inputs/input_ids')}
  Keyword arguments: {'training': True}

Option 4:
  Positional arguments (1 total):
    * {'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='inputs/input_ids')}
  Keyword arguments: {'training': False}

Related issues

huggingface/transformers#4004
huggingface/transformers#2135
huggingface/transformers#2021

Please, help me!

rgwatwormhill · November 18, 2020, 12:25pm

In pytorch, you could save the model with something like

torch.save(model.state_dict(),‘/content/drive/My Drive/ftmodelname’ )

Then you could create a model using the pre-trained weights

tuned_model = BertForSequenceClassification.from_pretrained(‘bert-base-uncased’,
num_labels=NCLASSES,
output_attentions=True)

and then overwrite its weights from the saved state_dict with

tuned_model.load_state_dict(torch.load(‘/content/drive/My Drive/ftmodelname’,
map_location=torch.device(“cpu”)),
strict=False)

I expect you could do a similar save of the model state_dict using Tensorflow.

alchencjt · September 8, 2021, 7:30am

How did you end up solving the problem. I came across the same one.

OverFlow7 · November 27, 2021, 10:06pm

I have the same problem.

I can train a model, evaluate and predict if it’s within the same script, but if I save the model and then load it I get the same kind of error, is it currently impossible to load a saved model with tensorflow/keras?

Kforcode · November 30, 2021, 3:34pm

hugging face has model.save_pretrained() method

Topic		Replies	Views
How to save bert or distilbert model? 🤗Transformers	0	1118	November 3, 2022
Unmatched Signature when loading TF SavedModel Beginners	0	1211	November 10, 2021
How can we test Transformer Models after converting it to TFLite format Beginners	9	3283	March 26, 2024
Error while saving and loading a Bert model 🤗Transformers	0	943	November 21, 2022
SavedModel export for DistilBERT is failing 🤗Transformers	9	507	October 9, 2020

Convert transformer to SavedModel

Conversion code

Inference code

Error

Related issues

Related topics