Finetuning a pre-trained model

Midhunm10 · August 21, 2024, 10:49pm

I have finetuned Helsinki opus NLP model for translation from english to malayalam where the pretrained model size was around 230mb and after i finetuned this model on a small dataset the model’s size rose to 500mb, does anyone knows on why the size becomes twice as large and if there are any optimization techniques to mitigate this problem, thank you in advance.

Topic		Replies	Views
Model size doubles after finetuning Models	0	485	May 11, 2022
Why is deberta-v3-large model twice as large on disk after MLM finetuning? (notebook to reproduce) 🤗Transformers	0	397	May 16, 2022
Dataset size for fine-tuning Beginners	0	604	May 21, 2021
Finetuning model with smaller sequence size and Dmodel Models	0	337	April 15, 2021
Finetuning neox 20b, why is resulting model so small Beginners	1	297	September 19, 2022

Finetuning a pre-trained model

Related topics