Is there a open source implementation of "Deep Learning Based Page Layout Analyze"?

Is there a open source implementation of “Deep Learning Based Page Layout Analyze”?

Repo: GitHub - leonlulu/DeepLayout: Deep learning based page layout analysis
DeepLayout: A Semantic Segmentation Approach to Page Layout Analysis

Is there a model in Huggingface that could achieve the same?

@inproceedings{li2018deeplayout,
title={DeepLayout: A Semantic Segmentation Approach to Page Layout Analysis},
author={Li, Yixin and Zou, Yajun and Ma, Jinwen},
booktitle={International Conference on Intelligent Computing},
pages={266--277},
year={2018},
organization={Springer}
}
1 Like

I’m not sure about that paper, but this library is very useful, and you can plug and play with different object detection models

1 Like

@eugeneware Layout Parser library is super interesting. But looks like it is more for understanding PDF documents.

Are you aware of any specific layout parsers to understand Web layout. Lets say it can understand web elements in a web page.

Web elements could be:

  • A Table
  • A Drop Down Menu
  • A Numbered List
  • A Bulleted List
  • A Radio Button

I’m not personally aware - but if you can find a dataset of annotated web layouts, you could use that library to fine tune your own library. I imagine that paper you referenced would have some reference to datasets that they benchmarked on.

Yeah, that makes sense. I was looking into LayoutLM

But I do not really understand this model could be applied to my use case.