Fine Tuning Git Model for Malayalam Image Captioning

ElsaJohn · June 6, 2023, 9:41am

Hello everyone,

I am currently working on a project to fine-tune the GIT model for image captioning in the Malayalam language. I am following this tutorial Transformers-Tutorials/Fine_tune_GIT_on_an_image_captioning_dataset.ipynb at master · NielsRogge/Transformers-Tutorials · GitHub which fine-tunes the model on English captions. However, I am unsure about the changes that I need to make for Malayalam.

Here are some of my questions:

The tutorial uses an English tokenizer. What should I use for Malayalam text? Is there a pre-trained tokenizer available that I can use, or do I need to train my own?
During model training, how can I ensure that the model’s output vocabulary matches the vocabulary of the Malayalam tokenizer?
Are there any Malayalam-specific considerations I should take into account when evaluating the model’s performance?

I would really appreciate any guidance or resources that can help me with this task. Thank you in advance for your time and help!

Topic		Replies	Views
How to Train an Image Captioning Model for specific language Beginners	3	18	March 9, 2025
Image Captioning with ViT and GPT 2 Base Models	2	60	May 10, 2025
Tokenizer effect on the fine-tuning Research	0	364	October 6, 2023
Image captioning for low resource Indian Languages 🤗 Course Projects	3	1630	December 20, 2022
Pretrain and Fine Tune Byte-level model for multilingual extractive QA (Like ByT5) Flax/JAX Projects	13	1985	July 2, 2021