I have tried to deploy the Gated Model which is of 7b and 14 gb in size on ml.g5.2x large instance on sagemaker endpoint. I have the access to the model and I am using the same code available on huggingface for deployment on Amazon Sagemaker. But It results into UnexpectedStatusException and on checking the logs it was showing. It is an gated Repo. You must be authenticated to use it.
Related Topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Sagemaker serverless endpoint deployment error (Image size greater than support size)) | 3 | 1003 | July 21, 2023 | |
Training model file too large and fail to deploy | 3 | 1356 | July 3, 2023 | |
Deploying Mixtral8x7B on AWS Sagemaker from S3 | 2 | 265 | June 11, 2024 | |
Sagemaker Serverless Inference | 22 | 8378 | May 22, 2024 | |
Deploy big model to AWS Sagemaker fails | 5 | 1024 | July 31, 2023 |