Sagemaker not being able to download EleutherAI/gpt-j-6B model from the Hugging Face Hub on start up

amangrk · December 11, 2021, 12:11am

Trying to create a basic inference on Sagemaker, but can see that the model downloads 30 -40% and then the download restarts again and the loop keeps on happening . And after 20 -30 mins it just fails.

Tried with different instances as well, but still the same issue persists.

Any help would be really appreciated.

Here is he exact code,

hub = {
‘HF_MODEL_ID’:‘EleutherAI/gpt-j-6B’,
‘HF_TASK’:‘text-generation’
}

create Hugging Face Model Class

huggingface_model = HuggingFaceModel(
transformers_version=‘4.6.1’,
pytorch_version=‘1.7.1’,
py_version=‘py36’,
env=hub,
role=role,
)

deploy model to SageMaker Inference

predictor = huggingface_model.deploy(
initial_instance_count=1, # number of instances
instance_type=‘ml.m5.4xlarge’ # ec2 instance type
)

predictor.predict({
‘inputs’: "Can you please let us know more details about your "
})

merve · December 16, 2021, 9:57am

Pinging @philschmid here

philschmid · December 16, 2021, 12:29pm

Hey @amangrk,

Sorry for missing your post.
Sadly, only models < 10GB are supported with direct loading from the HUB via the configuration. In addition to this, I can tell you that when you would move GPT-J 6B to s3 and then try to deploy SageMaker would timeout, expect you would go with the P4 instances but they are very expensive.

I already shared this with the AWS Team and they are looking into it.

jacobwjs · May 8, 2022, 7:50am

Any updates on this @philschmid? Running into the same issue with bigscience/tp00 model.

philschmid · May 8, 2022, 8:58am

We created an example on how to deploy GPT-J 6B maybe it can help you. Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker
We are also working on new ways of model parallelism which might make things easier.

Topic		Replies	Views
GPT-J fails on Amazon Sagemaker Models	2	1294	July 21, 2022
Batch transform inference job - downloading model from the Hugging Face Hub on start up Amazon SageMaker	2	1543	October 12, 2021
Getting error in the inference stage of Transformers Model (Hugging Face) 🤗Transformers	0	782	October 11, 2022
ModelError when I run predict after deploying gpt-j for question answering Amazon SageMaker	4	1317	February 28, 2023
Need help deploying a HF model to AWS Sagemaker Amazon SageMaker	3	150	September 27, 2024

Sagemaker not being able to download EleutherAI/gpt-j-6B model from the Hugging Face Hub on start up

create Hugging Face Model Class

deploy model to SageMaker Inference

Related topics