Size of saved model: Is there a way to make it smaller for deploy?

slowturtle · July 27, 2022, 3:13pm

Hello friends,

When I finish my trainning i am using:

trainer.save_model("mode_name")

The line above saves 6 files, including a pytorch_model.bin that is in average 400mb in size. This is a problem to due to github size limitation to 100mb(I know i can use git LFS).

After saving my model I am loading it to use with these lines:

from transformers import pipeline, BertTokenizer

tokenizer = BertTokenizer.from_pretrained(path)
classifier = pipeline('text-classification',model=path, tokenizer=tokenizer, top_k=5)

So my trainning arguments are saving all epochs. I already know how to avoid this.

My question is: is there a way to load the model, remove all epochs but the best one ans save it again for deploy with a smaller .bin file? I dont know if am packing a lot of stuff that I dont need to use the model, that’s my concern.

Thanks a lot,

pravinandhale · July 27, 2023, 7:25am

@slowturtle
Hi… I need your help related to NLP …can i have ur email id ?
This is mine … pravin.andhale03@gmail.com

Topic		Replies	Views
Storage-efficient ways to store models 🤗Transformers	0	298	July 8, 2023
Saving standard BertModel english and BertModel multilingual have drastically different sizes? 🤗Transformers	2	275	August 28, 2020
Model saving results in a small size checkpoint 🤗Transformers	1	624	January 4, 2021
Any model's size is huge when saved as opposed to downloading from hub pretrained 🤗Transformers	3	363	February 17, 2024
How to save my model to use it later Beginners	15	175773	November 10, 2024

Size of saved model: Is there a way to make it smaller for deploy?

Related topics