Deploy big model to AWS Sagemaker fails

Rabah9 · May 8, 2023, 11:50am

Hello,
I’m trying to deploy a model on AWS Sagemake (mosaicml/mpt-7b-chat · Hugging Face)

I used the code generated by the model’s page for a quick start, it is as follow:

from sagemaker.huggingface import HuggingFaceModel
import sagemaker

role = sagemaker.get_execution_role()
# Hub Model configuration. https://huggingface.co/models
hub = {
    'HF_MODEL_ID':'mosaicml/mpt-7b-chat',
    'HF_TASK':'text-generation'
}

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
    transformers_version='4.17.0',
    pytorch_version='1.10.2',
    py_version='py38',
    env=hub,
    role=role, 
)

# deploy model to SageMaker Inference
predictor = huggingface_model.deploy(
    initial_instance_count=1, # number of instances
    instance_type='ml.m5.xlarge' # ec2 instance type
)

predictor.predict({
    'inputs': "Can you please let us know more details about your "
})

The code crashes at the “predict” call with this error: Loading /.sagemaker/mms/models/mosaicml__mpt-7b-chat requires you to execute the configuration file in that repo on your local machine. Make sure you have read the code there to avoid malicious use, then set the option trust_remote_code\u003dTrue to remove this error.

I had the same issue locally that I fixed by adding the " trust_remote_code=True" parameter to the AutoModelForCausalLM.from_pretrained constructor, but that doesn’t seem to exist with HuggingFaceModel constructor.

Any way to fix this? thank you

benparfitt · May 13, 2023, 10:57pm

I had the same issue! Were you able to fix this?

Malathy · May 19, 2023, 2:46am

I am facing the same issue. Any input on how to fix it?

kennethleungty · May 28, 2023, 9:14am

Anyone with any solution for this? Seems like it is still not resolved

pathlight-rushil · June 5, 2023, 3:27pm

this medium article allowed me to deploy successfully Deploy MosaicML MPT-7B-Instruct on SageMaker | by Manoranjan Rajguru | May, 2023 | Medium

suchitavenugopal · July 31, 2023, 11:17pm

Was anyone able to fix this issue? Looks like this issue exists for all mosaicml/mpt-7b-* models. Is that right?

Topic		Replies	Views
Model mosaic/mpt-7b predict error after deploy in aws sagemaker Models	0	345	May 27, 2023
HF Model Deployment Trust Remote Code Amazon SageMaker	1	711	July 30, 2024
Databricks models deployment to sagemaker are not working Amazon SageMaker	6	1113	May 24, 2023
Register HuggingFaceModel on AWS and deploy Beginners	0	363	October 25, 2023
Deploy falcon 7b problems Beginners	4	2784	June 6, 2023

Deploy big model to AWS Sagemaker fails

Related topics