Hi, we are working on a multi agent tool to automate the recognition of tax receipts. We upload a pdf of e.g. 20 pages and then need to split the pages into individual documents. We assume that all pages of a receipt are grouped together, but there order might be shuffled. (e.g. receipt 1 page 1, re…

Cost of Tax receipt recognition OCR vs. LLM

John6666 March 21, 2025, 6:26pm 2

If you’re talking about extracting PDFs, the current open source models have reached a good level. I think that models like VLM and VLM combined with LLM, which I introduced below, are quite practical. I also think that there are several VLM models that specialize in PDF OCR, I think you can find various clues by searching for past posts on the forum.
In addition, if you need more specialized advice, I recommend asking a question on HF Discord.

Topic		Replies	Views
Extracting information from bills, tax statements, etc: What ML model to use? Research	3	3198	August 28, 2024
Best route for text extraction from Invoice documents Beginners	3	914	July 3, 2025
Image to text models tailored for web scraping? Models	1	812	June 9, 2024
Training a model for a PDF with OCR - where to begin? Beginners	4	10612	October 27, 2024
Model for evaluation of scanned survey data Beginners	9	35	January 19, 2025

Cost of Tax receipt recognition OCR vs. LLM

Related topics