Deploying a conversational pipeline on AWS

philschmid · January 7, 2022, 8:06am

Hey @JB2022,

we added support for conversational pipeline with a later release. Can you use instead of transformers_version="4.6" => 4.12 and for pytorch_version="1.7" => 1.9.

You can find the whole list of available containers here: Reference

Then your fist code snippet should work.

from sagemaker.huggingface import HuggingFaceModel
import sagemaker

role = sagemaker.get_execution_role()
# Hub Model configuration. https://huggingface.co/models
hub = {
	'HF_MODEL_ID':'microsoft/DialoGPT-medium',
	'HF_TASK':'conversational'
}

# create Hugging Face Model Class
huggingface_model = HuggingFaceModel(
	transformers_version='4.12',
	pytorch_version='1.9',
	py_version='py36',
	env=hub,
	role=role, 
)

# deploy model to SageMaker Inference
predictor = huggingface_model.deploy(
	initial_instance_count=1, # number of instances
	instance_type='ml.m5.xlarge' # ec2 instance type
)

predictor.predict({
	'inputs': {
		"past_user_inputs": ["Which movie is the best ?"],
		"generated_responses": ["It's Die Hard for sure."],
		"text": "Can you explain why ?",
	}
})

Topic		Replies	Views
How to deploy Whisper for other languages to Sagemaker? Amazon SageMaker	0	307	February 5, 2024
Deploying TheBloke/Luna-AI-Llama2-Uncensored-GGML Amazon SageMaker	0	844	September 11, 2023
Can't deploy conversational HF model on AWS - Logs say model-path not a valid directory Amazon SageMaker	4	1609	January 13, 2022
Deploying HG Pipelines on AWS Sagemaker Amazon SageMaker	4	1845	January 17, 2022
About the Amazon SageMaker category Amazon SageMaker	25	4102	August 5, 2021

Deploying a conversational pipeline on AWS

Related topics