Enhance a MarianMT pretrained model from HuggingFace with more training data

stelmath · September 7, 2020, 9:20am

I am using a pretrained MarianMT machine translation model from English to German. I also have a large set of high quality English-to-German sentence pairs that I would like to use to enhance the performance of the model, which is trained on the OPUS corpus, but without making the model forget the OPUS training data. Is there a way to do that? Thanks.

Also on StackOverflow

valhalla · September 7, 2020, 12:11pm

You could further fine-tune it on your own corpus, and I think if you have a high quality dataset then it should improve the results after fine-tuning.

You can use the finetune.py script from here for fine-tuning marian

rgwatwormhill · November 18, 2020, 2:52pm

If forgetting does turn out to be a problem, you could do your fine-tuning with a mixture of your new data and the OPUS data.

ayameRushia · May 29, 2021, 8:52am

Sorry can you re-share the link, the link doesn’t work

BramVanroy · May 29, 2021, 8:57am

Here it is.

Topic		Replies	Views
Adding New Tokens to MarianMT Model 🤗Tokenizers	8	760	February 4, 2024
Train MarianMT from scratch using transformers 🤗Transformers	0	320	December 28, 2022
How to train Marian Machine Translation Models	1	1036	June 23, 2022
Use custom Marian-NMT model in transformers 🤗Transformers	0	248	January 9, 2023
Fine-tuning of multilingual (translation) models Models	1	1508	August 17, 2023

Enhance a MarianMT pretrained model from HuggingFace with more training data

Related topics