Hitting Deployed Endpoint Outside of Notebook

rosenjcb · October 22, 2021, 9:35pm

All the tutorials tend to end at:

predictor.predict({"input": "YOUR_TEXT_GOES_HERE"})

It’s great that the notebooks deliver you to inference, but I have no idea how to hit this endpoint outside of the context of a Jupyter Notebook. I basically have Amazon AWS Java sdk code that does this:

AmazonSageMakerRuntime runtime = AmazonSageMakerRuntimeClientBuilder.defaultClient();

String body = "{\"instances\": [{\"data\": { \"input\": \"Hello World\"}}]}";

ByteBuffer bodyBuffer = ByteBuffer.wrap(body.getBytes());


InvokeEndpointRequest request = new InvokeEndpointRequest()
        .withEndpointName("huggingface-pytorch-training-....")
        .withBody(bodyBuffer);

InvokeEndpointResult invokeEndpointResult = runtime.invokeEndpoint(request);

Unfortunately, I get an error:

{
 "code": 400,
  "type": "InternalServerException",
  "message": "Content type  is not supported by this framework.\n\n            Please implement input_fn to to deserialize the request data or an output_fn to\n            serialize the response. For more information, see the SageMaker Python SDK README."
}

Am I missing something?

Topic		Replies	Views
Aws sagemaker deployed model that takes an image at endpoint Inference Endpoints on the Hub	4	1193	February 14, 2024
Unable to deploy to SageMaker via Studio notebook Amazon SageMaker	1	431	October 12, 2023
Facebook/bart-large-mnli inference when deployed on SageMaker Amazon SageMaker	1	1086	April 29, 2022
AWS Sagemaker doesn't return the full response Amazon SageMaker	1	134	July 17, 2024
InternalServerException from bart model created from s3 Amazon SageMaker	1	392	May 22, 2023

Hitting Deployed Endpoint *Outside* of Notebook

Related topics

Hitting Deployed Endpoint Outside of Notebook