Fail predict using Falcon-7B-Instruct

gmilano · June 1, 2023, 4:53am

I deployed tiiuae/falcon-7b-instruct in AWS SageMaker , when I will use a prediction it fails with the following error

(ModelError) when calling the InvokeEndpoint operation: Received client error (400) from primary with message "{
“code”: 400,
“type”: “InternalServerException”,
“message”: “Loading /.sagemaker/mms/models/tiiuae__falcon-7b-instruct requires you to execute the configuration file in that repo on your local machine. Make sure you have read the code there to avoid malicious use, then set the option trust_remote_code\u003dTrue to remove this error.”
}

I deployed directly from the hub, should I use another deployment option ?

My deployment code:

## Hub Model configuration. https://huggingface.co/models

hub = {

'HF_MODEL_ID':'tiiuae/falcon-7b-instruct',

'HF_TASK':'text-generation'

}

# create Hugging Face Model Class

huggingface_model = HuggingFaceModel(

transformers_version='4.26.0',

pytorch_version='1.13.1',

py_version='py39',

env=hub,

role=role,

)

# deploy model to SageMaker Inference

predictor = huggingface_model.deploy(

initial_instance_count=1, # number of instances

instance_type='ml.m5.xlarge' # ec2 instance type

)

Topic		Replies	Views
Deploy falcon 7b problems Beginners	4	2781	June 6, 2023
Getting error in the inference stage of Transformers Model (Hugging Face) 🤗Transformers	0	782	October 11, 2022
HF Model Deployment Trust Remote Code Amazon SageMaker	1	709	July 30, 2024
CPU/Memory Utilization Too High When Running Inference on Falcon 40B Instruct Amazon SageMaker	4	1576	August 31, 2023
Unable to deploy Falcon 40b OASST1 model into SageMaker TGI container Amazon SageMaker	0	432	July 29, 2023

Fail predict using Falcon-7B-Instruct

Related topics