Hi, Is it possible to use the Huggingface LLM inference container for Sagemaker ( Introducing the Hugging Face LLM Inference Container for Amazon SageMaker ) in a way that I can specify path to a S3 bucket where I have the models downloaded ready for use instead of downloading the models from interne…

Using S3 as model cache for Huggingface LLM inference DLC on Sagemaker

philschmid June 21, 2023, 3:36pm 2

1 Like

Topic		Replies	Views
SageMaker Pipeline from model saved on S3 Amazon SageMaker	1	1178	September 9, 2022
SageMaker Inference for Model Tuned Elsewhere Amazon SageMaker	4	1062	September 2, 2021
Deploying Mixtral8x7B on AWS Sagemaker from S3 Amazon SageMaker	2	458	June 11, 2024
How to use fine tuned Hugging face model saved at S3 at inference time? Amazon SageMaker	1	5001	May 4, 2022
Infer with SageMaker for a Private Model Amazon SageMaker	3	2404	June 30, 2022