Getting error in the inference stage of Transformers Model (Hugging Face)

MuhammadAli · October 11, 2022, 12:38pm

Greetings Everyone!

I have finetuned the model in the custom dataset and then trying to deploy it using amazon SageMaker using this code

from sagemaker.huggingface import HuggingFaceModel
import sagemaker

from sagemaker.huggingface import HuggingFaceModel
import sagemaker 

# role = sagemaker.get_execution_role()

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
   model_data="s3://stnewssentiment/model.tar.gz",  # path to your trained sagemaker model
   role=role, # iam role with permissions to create an Endpoint
   transformers_version="4.17", # transformers version used
   pytorch_version="1.10", # pytorch version used
   py_version="py38", # python version of the DLC
   env={ 'HF_TASK':'text-classification' },
)

predictor = huggingface_model.deploy(
   initial_instance_count=1,
   instance_type="ml.c5.large"
)

But when i try to predict using “predictor.predict(data)” then it gives me the error given below.

ModelError: An error occurred (ModelError) when calling the InvokeEndpoint operation: Received client error (400) from primary with message "{
“code”: 400,
“type”: “InternalServerException”,
“message”: “Can\u0027t load config for \u0027/.sagemaker/mms/models/model\u0027. If you were trying to load it from \u0027https://huggingface.co/models\u0027, make sure you don\u0027t have a local directory with the same name. Otherwise, make sure \u0027/.sagemaker/mms/models/model\u0027 is the correct path to a directory containing a config.json file”
}
". See https://us-east-2.console.aws.amazon.com/cloudwatch/home?region=us-east-2#logEventViewer:group=/aws/sagemaker/Endpoints/huggingface-pytorch-inference-2022-10-11-11-59-52-076 in account 430206693130 for more information.

I have tried to apply the things which already this community has discussed but in vain.

Topic		Replies	Views
InternalServerException when running a model loaded on S3 Amazon SageMaker	4	984	August 6, 2021
Use my finetuned Bert Model in SageMaker BatchTransform Amazon SageMaker	4	2971	April 30, 2022
ModelError when I run predict after deploying wizardcoder for text-generation Amazon SageMaker	1	926	September 25, 2023
ModelError when I run predict after deploying gpt-j for question answering Amazon SageMaker	4	1317	February 28, 2023
Qwen/Qwen1.5-72B-Chat Models	1	365	March 19, 2024

Getting error in the inference stage of Transformers Model (Hugging Face)

Related topics