Loading inference.py separately from model.tar.gz

liammc · March 22, 2023, 9:23am

Hi,

I’m deploying some inference endpoints on Sagemaker using the HuggingFace Inference Toolkit, and am overriding some of the default methods (model_fn and predict_fn) as described here:

This works, but the problem is that when I want to test changes to my inference.py, it is very slow and cumbersome to create and upload a new model.tar.gz file that includes the new inference.py script.

Is it possible to provide the inference.py script separately from the compressed model archive?

philschmid · March 22, 2023, 10:47am

No, there is sadly not way except that you can test your inference.py locally or load the model in the model_fn

augustindal · March 22, 2023, 11:04am

It is possible to give only the inference.py script if you have your model.tar.gz in s3. Not sure if it works with hub.

When you create the HuggingFaceModel() object, give it source dir (local folder where inference.py script is), entry point (inference.py) and model_data (s3 url).

Then next time you do HuggingFaceModel.deploy() it will use the inference script from your local folder and the model from s3.

philschmid · March 22, 2023, 12:39pm

Thats true but behind the scenes a new model.tar.gz is built and used.

Devj2013 · June 5, 2023, 8:47am

Where should the inference.py file be inside the model tar ball? Should it be inside the subdirectory “code” or should it be immediately inside the tar ball as a file?

Topic		Replies	Views
Inference Toolkit - Init and default template for custom inference Amazon SageMaker	12	2168	October 4, 2021
Help for inference.py code Amazon SageMaker	10	4009	March 8, 2022
SageMaker Inference for Model Tuned Elsewhere Amazon SageMaker	4	1077	September 2, 2021
How to Create Model in SageMaker Console from .tar.gz Amazon SageMaker	7	10416	March 10, 2022
How to quickly change the inferece.py for an endpoint on AWS SagemMaker Amazon SageMaker	1	778	December 2, 2022

Loading inference.py separately from model.tar.gz

Related topics