Calling Sagemaker Endpoint for fine-tuned summarization model

drcat101 · July 5, 2022, 1:50pm

Thanks @philschmid for your comment, but that wasn’t the problem in this case, the model.tar.gz was fine.

I just found the solution - I needed to add an extra parameter to the model like so:

env = {'HF_TASK': 'summarization'}

huggingface_model = HuggingFaceModel(
   model_data="s3://my-s3-path/model.tar.gz",
   role=role,
    env=env,
   transformers_version="4.6",
   pytorch_version="1.7",
   py_version='py36',
)

This is not in the documentation anywhere for fine-tuned models, but does appear in the tests for the inference package here: sagemaker-huggingface-inference-toolkit/test_models_from_hub.py at main · aws/sagemaker-huggingface-inference-toolkit · GitHub

Topic		Replies	Views
Deploying T5-style models via Sagemaker Endpoint: 'T5LayerFF' object has no attribute 'config' Amazon SageMaker	5	1484	November 7, 2022
InternalServerException from bart model created from s3 Amazon SageMaker	1	403	May 22, 2023
Inference failed for FLAN-UL2(20B) on SageMaker Amazon SageMaker	6	2207	April 4, 2023
Sagemaker Text Summarization Fine Tuning Job failing Amazon SageMaker	6	1610	June 9, 2022
How to Create Model in SageMaker Console from .tar.gz Amazon SageMaker	7	10469	March 10, 2022

Calling Sagemaker Endpoint for fine-tuned summarization model

Related topics