Hi, I’m working on fine-tuning bloom and deploying the model on Sagemaker and I wanted to know if it’s possible to stream the output generated text, with directly modifying the inference functions? I already tried with applying the TextIteratorStreamer within predict_fn function, but this doesn’t …

Streaming output text when deploying on Sagemaker

Harsh1729 June 11, 2023, 9:55am 5

Hey @RemiP thanks for your response. Can you pls elaborate how are you streaming outputs from the LLM deployed as HuggingFace inference endpoint? Appreciate you help:)

Topic		Replies	Views
Streaming output text When deploying a finetuned (SFT, DPO) model with custom inference script Amazon SageMaker	1	28	November 8, 2024
Deploying Sentence Transformer as sagemaker endpoint Amazon SageMaker	18	8025	March 26, 2024
Is it possible to have streaming responses from inference endpoints? Inference Endpoints on the Hub	6	2058	July 24, 2023
Sagemaker parameters via AWS client Amazon SageMaker	2	679	June 27, 2023
Text generation. Stream output 🤗Transformers	2	5582	April 4, 2023

Streaming output text when deploying on Sagemaker

Related topics