Finetuning TrOCR on the IAM dataset

d0r1h · August 11, 2022, 6:45am

I’m finetuning TrOCR model using Seq2SeqTrainer API, and I’ve taken reference from this notebook.

My training is completed but there is no model saved, I’ve attached the image of the content in the output directory.

Also, I’ve attached my training notebook for reference.

nielsr · August 11, 2022, 10:37am

Hi,

The error you’re getting is:

OSError: Can’t load feature extractor for ‘./model’. If you were trying to load it from ‘Models - Hugging Face’, make sure you don’t have a local directory with the same name. Otherwise, make sure ‘./model’ is the correct path to a directory containing a preprocessor_config.json file

That’s because the Seq2SeqTrainer only saves the model files (namely the weights as a pytorch_model.bin file and the configuration as a config.json file).

However, you’re also loading the processor (TrOCRProcessor) from the directory. A processor combines a feature extractor (for the vision modality) and a tokenizer (for the text modality), hence it requires a preprocessor_config.json file as well as a vocab.txt file for the tokenizer. It seems that you’re just using this one:

processor = TrOCRProcessor.from_pretrained("microsoft/trocr-base-handwritten")

Hence, you can use this one when performing inference. You can always save its files using save_pretrained. You can see the files required here: microsoft/trocr-base-handwritten at main

Topic		Replies	Views
TrOCR issues Stop Iteration training Models	0	390	March 24, 2023
Fine-tuning TrOCR on new language 🤗Transformers	4	2347	April 10, 2025
How to fine tune TrOCR model properly? Beginners	2	8431	November 15, 2021
Fine-tuning TrOCR on custom dataset 🤗Transformers	1	2540	October 18, 2023
TrOCR training from scratch Beginners	1	1303	October 23, 2022

Finetuning TrOCR on the IAM dataset

Related topics