Dataset preparation for LayoutLM and LiLT

rushabhGod · January 14, 2025, 4:22am

I am working on an OCR project to perform document understanding on a specific type of document with a fixed layout (e.g., mandate forms). The layout has minor variations across images. My goal is to accurately extract key-value pairs and output them in JSON format.

Now for dataset preparation, i have samples of the document on which i want to perform OCR and extract text accurately. Now, when i am annotating the dataset, do I have to annotate them as keys and value generally like all keys as “key” and all values as “values” or each key and values specifically Date_key, Date_value, AccountNo_key, Account_Value, etc?

Which will be the best practice to annotate my dataset to train it on LayoutLM and LiLT model.?

for16 · April 27, 2025, 11:01pm

Hello, may I ask what OCR did you use for this project?

Topic		Replies	Views
Improving Key-Value Pair Extraction with LayoutLM and LiLT on Custom OCR Dataset Research	2	276	February 21, 2025
Looking for OCR post-processing for Visual Document Understanding Research	0	640	December 15, 2023
Which model to select Models	1	70	April 14, 2025
Image Token classification LayoutLMv3 Beginners	0	354	November 7, 2023
LayoutLM model annotation regarding Beginners	2	1040	May 3, 2023

Dataset preparation for LayoutLM and LiLT

Related topics