Seeking Advice on Implementing HTML Inspection Service

petrovvl · April 15, 2024, 2:26am

I want to implement a service that checks for broken HTML files and pinpoints the exact locations of errors, such as missing tags, excessive tags, unexpected special characters, etc.

I have a large dataset containing both valid and invalid HTML files. So far, I’ve chosen an LSTM model, which effectively reconstructs missing tags. Then, I compare the reconstructed text with the original and show the diff.

However, I’m unsure if this model will fulfill all my requirements or if there might be a better option available for my needs. I would appreciate any advice.

Topic		Replies	Views
Any web parser models? Beginners	0	175	April 19, 2024
Sentence-transformers/all-mpnet-base-v2 requires Input Text after Cleaning or Raw Text Only Models	0	592	January 6, 2022
Tips for Debugging Model Cards 🤗Transformers	11	681	September 18, 2020
Cost-Effective LLM for Extracting Web Selectors from E-Commerce HTML Models	0	90	February 17, 2025
Seeking Advice on Processing Support Conversations for Efficient RAG Model Search Intermediate	0	50	September 9, 2024

Seeking Advice on Implementing HTML Inspection Service

Related topics