Save and deploy distilbert model in AWS SageMaker

gopalkr272 · April 6, 2021, 1:23pm

I am trying to download the Hugging Face distilbert model, trying to save to S3. The model itself does not have a deploy method. So I am saving to S3, instantiating it and trying to deploy. May I know if this will work with Sagemaker. What I am doing wrong. Here are the steps:

model_name = ‘distilbert-base-uncased-distilled-squad’ model = DistilBertForQuestionAnswering.from_pretrained(model_name) tokenizer = DistilBertTokenizerFast.from_pretrained(model_name)

the below works and gives output context = “xxxx” question = “yyy?”

nlp = pipeline(‘question-answering’, model=model, tokenizer=tokenizer)

nlp({ ‘question’: ‘What organization is the IPCC a part of?’, ‘context’: context })

save the model to local folder model.save_pretrained(’./scripts/mymodel’)

zip the model file. with tarfile.open("./scripts/mymodel/model.tar.gz", “w:gz”) as tar: tar.add("./scripts/mymodel/pytorch_model.bin") tar.add("./scripts/mymodel/config.json")

upload the zipped file to S3 sagemaker.Session().upload_data(bucket=sagemaker_session_bucket, path=’./scripts/mymodel/model.tar.gz’, key_prefix=‘model’)

instantiating the saved model

bertmodel = PyTorchModel(entry_point=‘inference.py’, source_dir=‘scripts’, model_data=‘s3://’+sagemaker_session_bucket+’/model/model.tar.gz’, role=sagemaker.get_execution_role(), framework_version=‘1.5’, py_version=‘py3’)

the below does not work nlp = pipeline(‘question-answering’, model=bertmodel, tokenizer=tokenizer)

nlp({ ‘question’: ‘What organization is the IPCC a part of?’, ‘context’: context })

error recd - AttributeError: ‘PyTorchModel’ object has no attribute ‘config’

I am able to deploy the predictor predictor = bertmodel.deploy(initial_instance_count=1, instance_type=‘ml.m5.xlarge’)

g3casey · April 8, 2021, 2:47am

@gopalkr272 I am afraid I don’t have a complete answer but review my support ticket here:
https://github.com/huggingface/transformers/issues/11043
I hope this will help you find an answer.

OlivierCR · April 9, 2021, 8:06am

There are couple public samples showing Hugging Face model deployment on SageMaker, for example those:

amazon-sagemaker-deploy-nlp-huggingface/deploy_pretrained_BART_seq2seq_PyTorch.ipynb at main · aws-samples/amazon-sagemaker-deploy-nlp-huggingface · GitHub
Serving PyTorch models in production with the Amazon SageMaker native TorchServe integration | AWS Machine Learning Blog
GitHub - aws-samples/amazon-sagemaker-bert-classify-pytorch: This sample show you how to train BERT on Amazon Sagemaker using Spot instances

Topic		Replies	Views
Use my finetuned Bert Model in SageMaker BatchTransform Amazon SageMaker	4	2975	April 30, 2022
Deploying HG Pipelines on AWS Sagemaker Amazon SageMaker	4	1839	January 17, 2022
InternalServerException when running a model loaded on S3 Amazon SageMaker	4	984	August 6, 2021
Create a batch transform job with custom trained biobert model Amazon SageMaker	15	2045	February 22, 2022
Error deploying BERT on SageMaker Amazon SageMaker	20	5286	April 1, 2025

Save and deploy distilbert model in AWS SageMaker

Related topics