Deploy ONXX model to Sagemaker

kamneb · July 13, 2022, 12:28pm

Hi,

I am wondering if anyone has an example of how to deploy an ONXX converted model to Sagemaker.

What is the procedure to create a model artifact for deployment?

And is there anything changing when deploying the model to Sagemaker?

Thanks

philschmid · July 13, 2022, 1:40pm

Hey @kamneb,

Sadly there is no example yet for it. I hope i can create on soon.

The process would be similar to this example. Except that you would need to upload you onnx model and create a requirements.txt including onnxruntime/optimum and write a infernece.py

kamneb · July 14, 2022, 8:15am

Thanks @philschmid. I just changed the path to the onxx model in the inference.py file and I add the dependency in the requirements.txt file. it works well.

dvdmlee · January 31, 2023, 9:41pm

@kamneb can you share your requirements.txt and your inference.py?

a little confused how you load the onnx runtime as the HuggingFace container expects an transformers automodel object.

nbertagnolli · June 5, 2023, 3:56pm

I made a blog post walking through how to do this for a simple case here. Hopefully that can help streamline it for any folks trying to do this in the future : )

awss123 · April 18, 2024, 8:27am

@nbertagnolli Where’s the blog? The ‘here’ link does not take us there.

nbertagnolli · April 18, 2024, 4:15pm

Oh no sorry! Does this one work? Deploy an ONNX Transformer to Sagemaker.

Topic		Replies	Views
SageMaker Inference for Model Tuned Elsewhere Amazon SageMaker	4	1068	September 2, 2021
Multi-Model Endpoint with Hugging Face Amazon SageMaker	6	2410	March 3, 2024
Serverless deploy troubles Amazon SageMaker	5	1447	May 16, 2022
Deploying Sentence Transformer as sagemaker endpoint Amazon SageMaker	18	8169	March 26, 2024
Need help deploying a HF model to AWS Sagemaker Amazon SageMaker	3	150	September 27, 2024

Deploy ONXX model to Sagemaker

Related topics