How to Query the Progress of Inference on a Custom Endpoint Handler?

isatis · August 19, 2023, 6:15am

Hello Hugging Face community,

I’m currently working with a custom handler for my inference pipeline, and I’m trying to understand how I can query the progress of the inference from an endpoint.

Below is the method I’m currently using:


def __call__(self, data: Any) -> List[List[Dict[str, float]]]:
    """
    Args:
        data (:obj:):
            includes the input data and the parameters for the inference.
    Return:
        A :obj:`dict`:. base64 encoded image
    """
    inputs = data.pop("inputs", data)
    
    # run inference pipeline
    with autocast(device.type):
        image = self.pipe(inputs, guidance_scale=7.5)["sample"][0]  
        
    # encode image as base 64
    buffered = BytesIO()
    image.save(buffered, format="JPEG")
    img_str = base64.b64encode(buffered.getvalue())

    # postprocess the prediction
    return {"image": img_str.decode()}

While this works to get the result, it doesn’t provide any insights into how far the inference has progressed.

My questions are:

How can I modify the above __call__ method to provide updates or feedback about the inference progress?
How do I subsequently query the endpoint to get this progress information?

Any help, sample code, or pointers would be greatly appreciated!

Thank you in advance!

hbredin · November 10, 2023, 10:35am

Did you find a solution?
That would be very handy!

Topic		Replies	Views
Custom Inference handler.py: FileNotFoundError Inference Endpoints on the Hub	8	814	April 8, 2024
Help with custom handler.py for model inference endpoint Beginners	1	734	February 24, 2024
No custom pipeline found at /repository/handler.py Inference Endpoints on the Hub	4	719	April 3, 2023
Is it possible to have streaming responses from inference endpoints? Inference Endpoints on the Hub	6	2087	July 24, 2023
How to have custom output size for inference API Inference Endpoints on the Hub	4	1315	February 16, 2023

How to Query the Progress of Inference on a Custom Endpoint Handler?

Related topics