Can someone point me to docs for how to train my own a model?

DataJuggler · January 3, 2023, 7:20pm

I would like to build an OCR library and / or a Font detection system. I have a large dataset of images with text I would like to use to train a model.

How do you go about this? I am a C# programmer but also know Python.

Thank you

rimusa · January 3, 2023, 10:30pm

Hi!

A good suggestion would be to go through the general HuggingFace tutorial, as this will help you understand the models and how they work:

Once you have the general idea of how the different HuggingFace packages work, you can check which OCR models there are available. Some examples are -

The TrOCR model in the transformers module:

Searching the HuggingFace repos for OCR models:

You can also check in towardsdatascience or in medium to see if there are ocr tutorials using HuggingFace.

DataJuggler · January 3, 2023, 11:04pm

Thank you. When I get a chance I will read up on this. I wasn’t sure where to start.

Topic		Replies	Views
Muti-Task Model - OCR + Object Detection Research	0	959	June 8, 2023
Help with Training a Custom Model using Hugging Face Transformers Beginners	0	30	October 11, 2024
How to extract tables from images using Hugging Face models? 🤗Transformers	1	354	September 17, 2024
Train with Text Beginners	0	199	October 20, 2023
How to train hugging face model? 🤗Transformers	0	335	April 14, 2023

Can someone point me to docs for how to train my own a model?

Related topics