Deploy ONXX model to Sagemaker

Hi,

I am wondering if anyone has an example of how to deploy an ONXX converted model to Sagemaker.

What is the procedure to create a model artifact for deployment?

And is there anything changing when deploying the model to Sagemaker?

Thanks

Hey @kamneb,

Sadly there is no example yet for it. I hope i can create on soon.

The process would be similar to this example. Except that you would need to upload you onnx model and create a requirements.txt including onnxruntime/optimum and write a infernece.py

Thanks @philschmid. I just changed the path to the onxx model in the inference.py file and I add the dependency in the requirements.txt file. it works well.

1 Like