Streaming output text When deploying a finetuned (SFT, DPO) model with custom inference script

xnaxi · November 7, 2024, 2:39pm

I have a finetuned mistral instruct V2 model that has been fine-tuned using SFT followed by DPO mechanisms. I want to deploy this on sagemaker with custom inference script. While doing this I also want to stream the output via sagmaker. Is this possible. I have seen a comment last November saying - One cannot use the llm container with a custom inference script. (Deploying custom inference script with llama2 finetuned model - #2 by philschmid)

Any idea if we can do this now…any help would be appreciated

xnaxi · November 8, 2024, 7:10am

@philschmid…Would appreciate your help

Topic		Replies	Views
Streaming output text when deploying on Sagemaker Amazon SageMaker	5	2503	October 6, 2023
Deploying custom inference script with llama2 finetuned model Amazon SageMaker	6	1264	January 4, 2024
Inference Toolkit - Init and default template for custom inference Amazon SageMaker	12	2181	October 4, 2021
About the Amazon SageMaker category Amazon SageMaker	25	4130	August 5, 2021
Help for inference.py code Amazon SageMaker	10	4020	March 8, 2022

Streaming output text When deploying a finetuned (SFT, DPO) model with custom inference script

Related topics