Sagemaker Serverless Inference for LayoutLMv2 model

marshmellow77 · January 28, 2022, 5:28am

Hi Elman, thanks for opening this thread, this is a super interesting topic

No matter what model you deploy to a Sagemaker (SM) endpoint, the input always requires preprocessing before it can be passed to the model. The reason you can just pass some text in the case of DistilBert without having to do the processing yourself is that the SageMaker Hugging Face Inference Toolkit does all the work for you. This toolkit on builds on top of the Pipeline API, which is what makes it so easy to call.

What does that mean for you when you want to use a LayoutLMV2 model? I see two possibilities:

The Pipeline API offers a class for Object Detection: Pipelines. I’m not familiar with it but I would imagine that it is quite straightforward to use. Again, because the Inference Toolkit is based on Pipelines, once you figure out how to use the Pipeline API for Object Detection you can use the same call for the SM Endpoint
The Inference Toolkit also allows you to provide your own preprocessing script, see more details here: Deploy models to Amazon SageMaker. That means you can process the inputs yourself before passing it to the model. What I would do (because I’m lazy) is to just look at an already existing demo to see how the preprocessing for a LayoutLMV2 model works. For example this one: app.py · nielsr/LayoutLMv2-FUNSD at main, and use that.

Hope this helps, please let me know how it goes and/or reach out if any questions.

Cheers
Heiko

Topic		Replies	Views
How to train LayoutLMv2 on the Sequence Classification task in AWS Sagemaker? Amazon SageMaker	4	1622	August 4, 2022
Unbale to deploy layoutlmv2 document image classification( RVL-CDIP) DeepSpeed	0	236	February 9, 2023
LayoutLMV3 on Sagemaker Amazon SageMaker	11	1840	December 19, 2022
Inference failed for FLAN-UL2(20B) on SageMaker Amazon SageMaker	6	2167	April 4, 2023
Calling Image Classification Model Deployed in SageMaker Endpoint Amazon SageMaker	20	4176	January 3, 2025

Sagemaker Serverless Inference for LayoutLMv2 model

Related topics