Hi All, Which model can read CAD PDF Drawing documents.
1 Like
Hi @RameshAnbalagan !
Reading CAD PDF drawing documents can be tricky because they often contain a mix of text, vector graphics, and images. Depending on your goal, here are some options:
1. Extract Text (OCR)
If you want to extract text from the PDF (like dimensions or annotations), you can use OCR models:
- Tesseract OCR: A free, open-source option for recognizing text in PDFs.
- Azure Form Recognizer or Google Cloud Vision: More advanced cloud-based options for structured text extraction.
2. Understand Vector Graphics
To process the vector graphics or drawings themselves, try:
- DeepCAD: A deep learning model designed for CAD drawings.
- SVG or DXF Parsing Libraries: Convert PDF to DXF or SVG format and analyze them using CAD-specific libraries like
ezdxf
orsvgpathtools
.
3. Hybrid Models for PDF Analysis
For a combination of text and image understanding:
- LayoutLMv2 or LayoutLMv3 (Hugging Face): Specialized for document layouts, including text and images.
Hope this help!
1 Like