Model works but MultiDataModel doesn't

wsunadawong · July 20, 2021, 8:28pm

This is a continuation of my post here. I’m trying to deploy BERT for text classification with tensorflow. When I use the model.deploy() method, I can successfully get inferences from BERT. Here’s my problem: I have four different models for classification and I want to run them on the same instance, not multiple instances, to save on cost. So I tried using the MultiDataModel class, but I keep getting the following error:

The CloudWatch logs don’t add any additional information, unfortunately. Here’s the structure of counterargument.tar.gz in the s3 bucket, which I cloned from my HuggingFace account and zipped.

counterargument.tar.gz
    config.json
    special_tokens_map.json
    tf_model.h5
    tokenizer.json
    tokenizer_config.json
    vocab.txt

The most puzzling thing about this error is that model.deploy() worked fine, but multi_model.deploy() doesn’t! Thanks in advance.

philschmid · July 21, 2021, 7:11am

@wsunadawong when using Multi-Model-Endpoints SageMaker stores the models differently. That’s why model.deploy() seems to work and MME does not. We (Amazon & HF) are looking into it. Hopefully, we can come back with a fix as soon as possible!

dan21c · July 21, 2021, 4:19pm

Could you share logs from the endpoint’s cloudwatch logs? Also, the counterargument.tar.gz is a flat directory or does that tar have a directory inside which these files exist?

wsunadawong · July 21, 2021, 4:20pm

The counterargument.tar.gz has a flat directory structure. I git cloned my huggingface repo and just zipped it. Here’s a picture of the contents:

wsunadawong · July 21, 2021, 4:24pm

And here’s a picture of the logs

dan21c · July 21, 2021, 8:07pm

Could you share more of the log. I see that “Model model loaded” message. I want to understand where invocation failed.

wsunadawong · July 21, 2021, 8:13pm

Here’s the logs which were above what I sent. There’s nothing useful below–just ping messages.

wsunadawong · July 21, 2021, 8:25pm

Oh my mistake, there were more non-ping logs below. Is this what you were looking for?

dan21c · July 22, 2021, 12:10am

The container seems to be using default model directory to lookup for model instead of MME platforms model directory. We will try to reproduce this and get back.

Ilias · August 18, 2021, 2:10pm

Any news on this by any chance?

philschmid · August 18, 2021, 2:37pm

Hey @Ilias,

The MultiDataModel was solved. The model.tar.gz used was wrong. You can take a look here.

Topic		Replies	Views
Error deploying BERT on SageMaker Amazon SageMaker	20	5285	April 1, 2025
Multi-Model Endpoint with Hugging Face Amazon SageMaker	6	2410	March 3, 2024
Inference Toolkit - custom inference with multiple models Amazon SageMaker	1	633	April 4, 2024
Databricks models deployment to sagemaker are not working Amazon SageMaker	6	1113	May 24, 2023
Training model file too large and fail to deploy Amazon SageMaker	3	1377	July 3, 2023

Model works but MultiDataModel doesn't

Related topics