M2m-100 finetuning

AnonymousArt · March 26, 2022, 3:17am

I’ve been experimenting with the finetuning of m2m100, hoping that fine-tuning one lang pair (eg; en-fr) would result in an improvement for that lang \ pair and would not affect the rest of the model, however, following this huggingface guide for m2m100 in which it said to set tokenizer lang for multi-lingual models.

However upon training (tested upon an already fine-tuned for en - fr model located here- NDugar/m2m100_418M-fr · Hugging Face )

This model also suffered the same issue as mine, in which translating another lang pair, such as spanish to russian would result in the implementation of french grammar and words into the resulting translation, showing that they’ve seemed to blend together.

I am a complete newcomer to NLP and AI in general so apologies if this is a dumb post. I have observed, however, in my compute_metrics function , when called with .evaluate() before even training with new data, the resulting predictions from the data passed to the trainer are not in the target language, but in a mix of seemingly random languages
[IMAGE2 BELOW POST]

[IMAGE3 BELOW POST]

Essentially what I’m asking, is it possible to train m2m-100 for only one language pair, and preserving the weights of all other lang-pairs / languages not involved, as it seems even training causes the model to train with incorrect languages or something else is affecting it.

AnonymousArt · March 26, 2022, 3:20am

[IMAGE3] - https://media.discordapp.net/attachments/954340654069202966/957116541617446933/unknown.png?width=965&height=488

[IMAGE2] - https://media.discordapp.net/attachments/954340654069202966/957116757041094686/unknown.png?width=895&height=488
The predicitons var in question

Jour · May 2, 2022, 12:25pm

I have the same problem with this code: Google Colab.
We must have done the same thing…

anzorq · August 16, 2022, 9:55pm

Hi, any progress with this?

awaiskaleem · November 23, 2022, 11:17am

+1 facing the same issue here

Topic		Replies	Views
M2M model finetuning on multiple language pairs 🤗Transformers	4	1464	August 17, 2022
Fine-tuning M2M100 & Mbartcc25 for Machine Translation OnetoMany Models	2	981	November 23, 2022
How can I train M2M-100 or NLLB-200 on my parallel bilingual corpus? 🤗Transformers	0	783	September 22, 2022
Finetune different language pair on pretrained translation model Models	1	954	May 26, 2022
Fine-tuning of multilingual (translation) models Models	1	1502	August 17, 2023

M2m-100 finetuning

Related topics