TrOCR training from scratch

Shiro · June 2, 2022, 10:07pm

Hi,

I was wondering if anyone succeed to train TrOCR from scratch with the huggingface library ?
I have some weird behaviors where the model is not really learning when I create data from this repository : GitHub - clovaai/synthtiger: Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

I suppose it may be related to hyper parameters but until now I did not succeed to make the model better. (I did not succeed to get good results neither with a finetuning instead of pretraining from scratch)

so my question is : does anyone succeed to finetune on an artificial dataset the TrOCR model ? With which parameters? Because the finetuning with IAM dataset works well but as soon as I used artificial dataset it does not work. (whatever is the sequence length to predict)

HGamal · October 23, 2022, 1:00pm

Hi, have your found out what was the problem?

Topic		Replies	Views
Fine-tuning TrOCR to do digit recognition in another language Models	0	287	May 21, 2024
Fine tune trocr model Models	1	181	April 18, 2025
Can someone point me to docs for how to train my own a model? Models	2	621	January 3, 2023
Help with Training a Custom Model using Hugging Face Transformers Beginners	0	30	October 11, 2024
Tutorial: Fine-tuning with custom datasets – sentiment, NER, and question answering 🤗Transformers	19	12845	February 12, 2024

TrOCR training from scratch

Related topics