LayoutLMV3 for Token Classification

artpods56 · June 19, 2025, 1:41pm

The OCR engine you are using for inference should be the same you used for training. By default the Transformers AutoProcessor uses PyTesseract under the hood when you set apply_ocr attribute to True. So the solution is to use the same ocr engine during inference and to manualy pass words and bboxes to the processor.
Please correct me if I’m wrong but I feel like this is the case.

Topic		Replies	Views
Image Token classification LayoutLMv3 Beginners	0	360	November 7, 2023
How to Decode InputIDs back to String in LayoutLMV3 🤗Transformers	2	1371	March 8, 2024
LayoutLMv3 inference - bboxes are incorrect 🤗Transformers	0	118	May 10, 2024
LayoutLMv3 Inference Intermediate	2	1164	March 11, 2024
LayoutLMV3 inference without label 🤗Transformers	0	103	May 28, 2024

LayoutLMV3 for Token Classification

Related topics