How well does a language model perform when fine-tuned on a dialect of its trained language?

yassineafr · May 23, 2024, 7:17pm

I am currently working on fine-tuning an Arabic language model to adapt it to the Moroccan dialect using the LoRA (Low-Rank Adaptation) technique with a high rank. This is based on intuition, and I’m uncertain about its effectiveness due to the lack of high-quality data; my dataset consists mainly of YouTube comments and replies. I’m seeking advice on whether this approach is worthwhile or if I should consider an alternative strategy.

Topic		Replies	Views
Fine-tuning BERT for Machine Translation Models	0	725	May 21, 2022
Poor results (val_loss) on fine-tuning the NLLB-200-600M with LoRA for French-Wolof translation 🤗Transformers	3	312	October 1, 2024
Has Anyone Successfully Fine-Tuned Whisper for a Local Language for better accuracy Beginners	5	210	May 27, 2025
Need Help Understanding Fine-Tuning Techniques for My Thesis Beginners	3	78	January 6, 2025
Fine-tune a translation model on monolingual data Intermediate	1	434	June 16, 2022

How well does a language model perform when fine-tuned on a dialect of its trained language?

Related topics