Fine tune trocr model

richasinha12 · April 17, 2025, 6:29pm

Hi everyone,

I’m currently working on fine-tuning the TrOCR model (from Hugging Face) for a handwriting OCR task using two datasets: IAM Handwriting Database and Imgur5K. My goal is to achieve:

CER ≤ 7%

WER ≤ 15%

However, after 10 epochs of training with TrOCR Base, my best results are:

CER: 48%

WER: 68%

Clearly, I’m far from the target and can anyone hrlp me in achieve target

Here’s what I’m using:

Model: TrOCR Base

Hi everyone,

I’m currently working on fine-tuning the TrOCR model (from Hugging Face) for a handwriting OCR task using two datasets: IAM Handwriting Database and Imgur5K. My goal is to achieve:

CER ≤ 7%

WER ≤ 15%

However, after 15 epochs of training with TrOCR Base, my best results are:

CER: 48%

WER: 68%

Clearly, I’m far from the target and can anyone help

Here’s what I’m using:

Model: TrOCR Base

John6666 · April 18, 2025, 5:42am

Hmm…

by Hugging Chat: HuggingChat

To improve the performance of your TrOCR model for handwriting OCR, consider the following structured approach:

Data Preprocessing and Augmentation:
- Clean Up and Normalize: Ensure images are noise-free and normalized in size and contrast. This helps the model focus on relevant features.
- Augmentation: Apply transformations like rotation, cropping, and noise addition to artificially expand your dataset, enhancing generalization.
Training Strategy:
- Increase Epochs: Extend training beyond 10 epochs to allow the model to learn more effectively.
- Learning Rate and Optimizer: Experiment with different learning rates and optimizers. Consider implementing learning rate scheduling for efficiency.
Model Fine-tuning:
- Larger Models: Explore using TrOCR-large for better performance, despite higher computational costs.
- Alternative Models: Consider fine-tuning models known for strong handwriting performance.
Tokenizer Optimization:
- Comprehensive Vocabulary: Ensure the tokenizer includes all necessary characters and introduce a special token for unknown characters.
Data Balancing:
- Check Bias: Evaluate dataset balance and address any biases, possibly through balancing techniques or adjusted loss functions.
Evaluation Metrics:
- Expand Metrics: Track additional metrics like accuracy and analyze model failures to identify patterns in errors.
Cost Function and Regularization:
- Experiment with Loss Functions: Try different loss functions and add regularization like dropout to prevent overfitting.
Model Architecture Adjustments:
- Enhance Architecture: Consider modifying the transformer layers or adding attention mechanisms for improved performance.
Research and Comparison:
- Benchmarking: Review existing research and compare your approach with others who have used TrOCR on similar tasks to identify best practices.

By systematically addressing each of these areas, starting with preprocessing and training adjustments, you can identify and mitigate performance bottlenecks, progressively improving your model’s CER and WER.

Topic		Replies	Views
Fine-tuning TrOCR on custom dataset 🤗Transformers	1	2549	October 18, 2023
Fine-tuning TrOCR on new language 🤗Transformers	4	2361	April 10, 2025
Getting poor word acuracy after fine-tuning trocr on Bangla language Beginners	4	483	October 18, 2023
How to fine tune TrOCR model properly? Beginners	2	8447	November 15, 2021
Fine-tuning TrOCR to do digit recognition in another language Models	0	287	May 21, 2024

Fine tune trocr model

Related topics