About the Amazon SageMaker category

pierric · March 18, 2021, 3:32pm

This category is for any questions related to using Hugging Face Transformers with Amazon SageMaker. Don’t forget to check the announcement blogpost for more resources.

m-ali-awan · May 5, 2021, 7:16pm

Thanks for this amazing project, definitely HuggingFace, and Sagemaker, both are the leading in their particular domains, and integrating both, will definitely enhance their effectiveness.

Is it currently possible to deploy real-time endpoints with Sagemaker, using Huggingface?

Thanks…

philschmid · May 6, 2021, 6:57am

Hey @m-ali-awan,

thank you for the feedback

We are currently working on a nice way to deploy all of the Hugging Face models to SageMaker. But this still takes a little time. In the meantime, you could use either the tensorflow or pytorch inference toolkit.

m-ali-awan · May 6, 2021, 9:37am

Thanks for responding.
Are there any example nbs available for deploying?

And, I will be very grateful if you can guide me about this:
I am trying to build custom Document-Classifier with HuggingFace, but my client is currently using Amazon-Comprehend, so is it possible to come up with better classifer than Comprehend, as we have less data:i.e 50/class, and total 20 classes?

Thanks…

OlivierCR · May 6, 2021, 10:50am

Hi @m-ali-awan , thanks a lot for reaching out! you can find HF deployment examples here:

GitHub - aws-samples/amazon-sagemaker-bert-classify-pytorch: This sample show you how to train BERT on Amazon Sagemaker using Spot instances (the training part uses the PyTorch container but should work fine in the newer HF container as well)
Serving PyTorch models in production with the Amazon SageMaker native TorchServe integration | AWS Machine Learning Blog
Fine-tuning a PyTorch BERT model and deploying it with Amazon Elastic Inference on Amazon SageMaker | AWS Machine Learning Blog

Note that the SageMaker hosting experience varies depending on your version of PyTorch (MMS backend or TorchServe backend) Use PyTorch with the SageMaker Python SDK — sagemaker 2.39.1 documentation

It’s not possible to tell you whether HF or Comprehend will give you better results because half the answer is in the developer hands : it depends on the data, the model, its training (epochs, optimizers…). Using your own Hugging Face code in SageMaker, you will indeed have more freedom from a science and system architecture standpoint (free to inspect the model, export it out of AWS, play with its weights, test various backends and tasks etc), but be aware that with more freedom comes more responsibility: model science and infrastructure becomes your ownership. In Comprehend, more things are managed, with a concession on development freedom.

m-ali-awan · May 6, 2021, 11:26am

Thanks alot.

philschmid · May 6, 2021, 11:27am

Another option would be to upload your fine-tuned model to the Hugging Face Hub as either a private or a public model and then use it with the Accelerated Inference API. You can test the API for free or go with a plan that fits you and your customer. You can compare it to Comprehend that it is managed, but it is easier to provide a custom model and benefit from the accelerations and optimizations the Accelerated Inference AP is doing.

m-ali-awan · May 7, 2021, 2:54am

Thanks for this option.
So, curently we are using Textract for OCR, and we want to use .txts from this pipeline, to be fed to Documnet-Classification, so can we integrate this API into AWS Lambda, or what should be the way to go?

Thanks…

philschmid · May 7, 2021, 6:59am

Yes, you could create a Python AWS Lambda function to read the .txt files. Then depending on how big the documents are splitting them into passages and then send the documents to the Inference API with a POST HTTP-Request.
You can the documentation here.

m-ali-awan · May 7, 2021, 9:09am

Thanks, so is there any limit on the no of characters in a document, and if how can we cater for(increase) it.

Thanks…

philschmid · May 7, 2021, 9:58am

The limit of Tokens is depending on the model you use. For example bert-base-case has a max_length of 512 same as bert-large-uncased.

m-ali-awan · May 7, 2021, 11:22am

Ok, thanks, and when we are training for custom classes, we simply can increase these?
And definitely, there would be some memory limitations, so how to cater this at inference for relatively large documents?

Thanks…

philschmid · May 7, 2021, 11:37am

It is not that simple to increase those. But there are Transformer models like Longfromer who have a max token length of 4096. Also available on the Hub. allenai/longformer-base-4096 · Hugging Face

m-ali-awan · May 7, 2021, 11:58am

Thanks a lot, @philschmid,

I have already teased you a lot, but sorry, I am not experienced enough with HuggingFace.

So, can we train this Longformer, for custom case?

Thanks…

m-ali-awan · May 31, 2021, 10:12am

Hi philishmid, hope you are fine.
Do we have support for Relation-Extraction models, to be fine-tuned with sagemaker.
If yes, kindly share any relevant notebook link.

Thanks a lot…

m-ali-awan · June 1, 2021, 12:47am

Please, share with me any resources(colab notebooks) related to relationship-extraction.

philschmid · June 1, 2021, 6:26am

@m-ali-awan, You can find all example we currently have in these 4 different resources

m-ali-awan · June 1, 2021, 6:28am

Thanks alot, but do it contains examples for Relationship-Extraction?

m-ali-awan · June 1, 2021, 6:29am

Or any new state-of-the-art NER?

philschmid · June 2, 2021, 5:45am

@m-ali-awan yes the community notebooks, as well as the example scripts, include examples for Named-Entity-Recognition.
On the Hub, we have ~800 models trained for token classification. Take a look and see if one of these fits your use-case Hugging Face – The AI community building the future.

Topic		Replies	Views
Facebook/bart-large-mnli inference when deployed on SageMaker Amazon SageMaker	1	1083	April 29, 2022
Serverless endpoint deployment with Terraform Amazon SageMaker	3	1664	November 2, 2022
How to use fine tuned Hugging face model saved at S3 at inference time? Amazon SageMaker	1	5073	May 4, 2022
Deploying HG Pipelines on AWS Sagemaker Amazon SageMaker	4	1839	January 17, 2022
Endpoint Deployment Amazon SageMaker	1	1112	September 20, 2021

About the Amazon SageMaker category

Related topics