[discuss] approaches for reading order detection

what are the different models and dataset using ML techniques to find the reading order of layout sections in pdfs? any open license model will be a plus

To find the reading order of layout sections in PDFs using ML techniques, you can explore models and datasets like:

  1. LayoutLM: Uses transformer models to understand the layout and reading order in documents.
  2. DocFormer: Focuses on document understanding and layout analysis.
  3. OCR and Layout Analysis Datasets: Open datasets like PubLayNet and DocVQA can be used for training models. locksmith services

These models and datasets can help identify and process reading orders in PDFs.