Question answeirng Fine tuning

nielsr · May 31, 2024, 1:03pm

There are multiple ways to solve this, as you’re working with invoices I’d assume a vision-language model to perform better than a text-only one.

See our blog post on document AI for an overview: Accelerating Document AI. Models like LayoutLM are better than text-only models like DistilBERT.

Nowadays there are also a lot of generative document AI models including PaliGemma, Idefics2, LLaVa,… besides Donut, Pix2Struct, UDOP.

Another option is to fine-tune a text-only LLM on OCR-ed text as I explained here: Fine tune LLMs on PDF Documents - #9 by nielsr

Topic		Replies	Views
How to get a model on patent data for question answering Intermediate	1	851	October 15, 2021
BERT fine-tuning Models	0	508	January 29, 2024
Help in Finetuning a DistilBert uncased Q/A model Models	0	274	June 2, 2021
Adding small data in fine tune model - bert Models	0	343	October 20, 2022
Inference from a fine-tuned model -- help with interpretation of results Beginners	3	369	January 26, 2024