Inference Hyperparameters

fiona · September 24, 2021, 12:23pm

My problem is slightly different. I need to invoke the model through a Sagemaker endpoint. I tried to add the parameter and it didn’t work.

client = boto3.client(‘sagemaker-runtime’)
s = {“inputs”: input_sentence, “parameters”: {“truncation”:True}}
payload = json.dumps(s).encode(‘utf-8’)
content_type = “application/json”

endpoint_name = “huggingface-pytorch-inference-2021-09-21-18-08-15-185”
accept = “application/json”

response = client.invoke_endpoint(
EndpointName=endpoint_name,
ContentType=content_type,
Accept=accept,
Body=payload,
)

I’m getting the following error with long input -
ModelError: An error occurred (ModelError) when calling the InvokeEndpoint operation: Received client error (400) from model with message "{
“code”: 400,
“type”: “InternalServerException”,
“message”: “The size of tensor a (577) must match the size of tensor b (512) at non-singleton dimension 1”
}

Topic		Replies	Views
Predict function ignore parameters Amazon SageMaker	8	1173	January 28, 2022
How are the inputs tokenized when model deployment? Amazon SageMaker	13	4278	September 3, 2021
How to deploy a T5 model to AWS SageMaker for fast inference? Amazon SageMaker	13	5795	February 28, 2022
Deploying Open AI's whisper on Sagemaker Amazon SageMaker	54	16200	April 12, 2024
About the Amazon SageMaker category Amazon SageMaker	25	4102	August 5, 2021

Inference Hyperparameters

Related topics