Get the Q&A in LayoutLMv2 in text form

paramdeep · February 3, 2022, 3:59pm

I am going through the LayoutLMv2 inference tutorial by @nielsr. This is a great tutorial to understand how LayoutLM is working.

The final output of the model is in the form of bounding boxes on top of the invoice image. I would like to get the Question & Answer pair as the output (JSON if possible). My questions are -

How can I get the Question and Answer text (instead of the bounding boxes)
How do I cluster the words in Questions/ Answer (right now there can be multiple bounding boxes for a single Question/ Answer

Any help in this regard would be appreciated.

nielsr · February 7, 2022, 8:17pm

Hi,

We plan to add LayoutLMv2ForRelationExtraction (that allows you to do just that) to the library. See here to follow the progress (it also includes a link to a Colab notebook).

Topic		Replies	Views
LayoutLMv3 Q/A Inference Beginners	2	2485	January 23, 2023
How to extract text using LayoutLM2 Beginners	0	1202	June 7, 2022
Getting links out of LayoutLM Beginners	0	308	November 5, 2021
LayoutLMv3 Inference Intermediate	2	1147	March 11, 2024
LayoutLMv3 inference - bboxes are incorrect 🤗Transformers	0	115	May 10, 2024

Get the Q&A in LayoutLMv2 in text form

Related topics