Finetune a pretrained huggingface translation model on a new language pair

molokanov · July 5, 2023, 9:37am

Is it possible to fine-tune any pretrained huggingface BERT-based multilingual translation model (e.g., NLLB) on a new language pair, with one language already seen (let it be English) and the other not seen in a pretrained model?
If yes, is all the procedure the same, i.e., create a dataset and implement a training/fine-tuning script?

mayowa-osibodu · January 12, 2024, 2:19pm

Here’s one possibly helpful resource: How to fine-tune a NLLB-200 model for translating a new language | by David Dale | Medium

There are some additional steps compared to the usual fine-tuning process, e.g. adding tokens from the new language to the tokenizer.

The article linked above uses a good amount of custom code though, I’m still looking around for something more Huggingface-centric.

Topic		Replies	Views
Finetune different language pair on pretrained translation model Models	1	953	May 26, 2022
Fine-tuning an NLLB model for a new language 🤗Transformers	7	2664	January 12, 2025
LM finetuning on domain specific unlabelled data Beginners	6	4666	April 21, 2021
Fine-tune BERT for Masked Language Modeling 🤗Transformers	3	3024	January 25, 2021
Is it possible to fine tune a Bert model using a small dataset (400 data)) Beginners	1	613	October 3, 2022

Finetune a pretrained huggingface translation model on a new language pair

Related topics