Deploying custom inference script with llama2 finetuned model

saitejinfrrd · November 23, 2023, 1:40pm

I have modified and provided the following as suggested with out any modifications to the above config :

hf_model = HuggingFaceModel(
    model_data=s3_model_uri,
    role=role,

    env=config,
    transformers_version="4.28.1",
    pytorch_version="2.0.0",
    py_version="py310",
    )

I get the following error in the logs:


huggingface_hub.utils._validators.HFValidationError: Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/opt/ml/model'. Use `repo_type` argument if needed.

I understand that I should change HF_MODEL_ID but if give the repo_name from the huggingface hub, will it consider the artifacts in the tar.gz file ?

Topic		Replies	Views
Streaming output text When deploying a finetuned (SFT, DPO) model with custom inference script Amazon SageMaker	1	32	November 8, 2024
Error loading finetuned llama2 model while running inference Amazon SageMaker	27	4812	September 20, 2023
Inference Toolkit - Init and default template for custom inference Amazon SageMaker	12	2146	October 4, 2021
Loading inference.py separately from model.tar.gz Amazon SageMaker	4	1870	June 5, 2023
Sagemaker deployment fails for local llama2 model Amazon SageMaker	3	2291	August 17, 2023

Deploying custom inference script with llama2 finetuned model

Related topics