How to quickly change the inferece.py for an endpoint on AWS SagemMaker

Rami · December 2, 2022, 5:48pm

I wondered how to change the inference script of a deploy hugging face model.

marshmellow77 · December 2, 2022, 6:27pm

Unfortunately this is not really possible, at least to my knowledge.

What you could do instead is to use local mode in SageMaker. Instead of deploying the model to a real-time endpoint you can “simulate” the deployment on the local machine (e.g. a Notebook instance). This way you can quickly test the deployment and change it as needed. Once the tests are successful you can deploy to the actual endpoint.

Here is an example how to do that with a HF model. More info on the Github page.

Hope that helps.

Cheers
Heiko

Topic		Replies	Views
SageMaker Inference for Model Tuned Elsewhere Amazon SageMaker	4	1069	September 2, 2021
Unclear documentation using CLIP on Sagemaker for inference Amazon SageMaker	1	1232	May 30, 2023
Loading inference.py separately from model.tar.gz Amazon SageMaker	4	1849	June 5, 2023
Sagemaker multimodel endpoint Amazon SageMaker	1	474	February 2, 2023
How do Inference Endpoints fit into larger solution? Inference Endpoints on the Hub	0	412	June 17, 2023

How to quickly change the inferece.py for an endpoint on AWS SagemMaker

Related topics