About the Amazon SageMaker category

m-ali-awan · June 22, 2021, 9:30am

Hi @philschmid, hope you are fine.
I have a model that is huge it requires 50GB of system Ram and after loading it to gpu it requires 14 Gb of gpu ram, and we will most likely go with g4dn.4xlarge which has 4 gpu’s and we will deploy one model on each gpu and make 1 api to interact with them.
So, is it possible to import huggingface pretraiend model in sagemaker, and then deploy it.

Thanks for any help.

philschmid · June 22, 2021, 1:21pm

Hey @m-ali-awan,

we are heavily working on an Inference Solution on SageMaker for Hugging Face to make it as easy as possible to deploy models, the current estimation for this to be released is early July so if you can wait a couple of weeks we will have a nice solution for it. If you cannot wait you could use the PyTorch implementation to deploy your model, but it is way more complicated and difficult.

m-ali-awan · June 22, 2021, 1:26pm

Hi @philschmid , thanks alot, as you always respond to my queries.

It would be great if you can share with me the link of PyTorch deployment.

Further, my model is a conversion from jax to pytorch, and it is GPT-J.Now as you know that with AWS inferentia chips, the deployment becomes cheap, and more efficient.
So, is it possible for GPT-J to be compiled with neuron sdk.As there is an example for huggingface bert, but I am not sure for GPT.As I know that neuron compilation does not work for all kinds of models.

Thanks again, for all your help.

philschmid · June 22, 2021, 2:10pm

In terms of using inferentia, I am not sure, if compiling GPT-J works you need to test. You also need to be careful when compiling with neuron to use the right configuration.

Here is a Pytorch example that requires a lot of custom code GitHub - aws-samples/amazon-sagemaker-bert-pytorch.

m-ali-awan · June 22, 2021, 2:13pm

Thanks, alot.

m-ali-awan · August 5, 2021, 3:42pm

Hi @philschmid , hope you are fine.
Is there any discord channel for huggingface?
If so kindly share me the invite…
Thanks…

Topic		Replies	Views
Endpoint Deployment Amazon SageMaker	1	1109	September 20, 2021
Batch Transform with Custom Infrastructure Amazon SageMaker	0	260	August 1, 2023
SageMaker Inference for Model Tuned Elsewhere Amazon SageMaker	4	1068	September 2, 2021
Need help deploying a HF model to AWS Sagemaker Amazon SageMaker	3	149	September 27, 2024
Use my finetuned Bert Model in SageMaker BatchTransform Amazon SageMaker	4	2968	April 30, 2022

About the Amazon SageMaker category

Related topics