Why does local-downloaded model files are different from those in huggingface?

zzongE · June 10, 2024, 6:18am

I downloaded model to my local PC and saved it using the following codes.

Codes:

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "beomi/Llama-3-Open-Ko-8B"

model = AutoModelForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)

model.save_pretrained("./path/to/model")
tokenizer.save_pretrained("./path/to/model")

Results:

Downloading shards: 100%|██████████| 6/6 [02:50<00:00, 28.42s/it]
Loading checkpoint shards: 100%|██████████| 6/6 [00:06<00:00,  1.01s/it]
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.

The actual model files uploaded in huggingface(beomi/Llama-3-Open-Ko-8B at main) has 6 model shards and each of model.safetensors filesize does not exceed 3GB,

while the files in my local PC does not match in the filesize or neither in the number of files.

Can anyone explain this situation or the way to solve the problem?

Thank you in advance

nielsr · June 10, 2024, 7:15am

Hi,

One can specify a max_shard_size when using the from_pretrained or push_to_hub methods, which defaults to 5 GB: Models

zzongE · June 10, 2024, 7:24am

Thank you.

But the thing I am curious about is that the model files in huggingface, the sizes are sum to less than 17GB as below.

while the sum of the model files i have downloaded in the local, is more than 30GB.

How can the size of models be changed?

nielsr · June 10, 2024, 7:33am

By default, a precision of float32 (32 bits or 4 bytes per parameter) is used. Hence, as beomi/Llama-3-Open-Ko-8B · Hugging Face has 8 billion parameters, that’s 8*4 = 32 GB.

If you load in half-precision (bfloat16 or 2 bytes per parameter) then you’ll get 8*2 = 16 GB.

zzongE · June 10, 2024, 7:36am

100 % understood, appreciate!!

system · June 10, 2024, 7:36pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Any model's size is huge when saved as opposed to downloading from hub pretrained 🤗Transformers	3	383	February 17, 2024
Why is uploaded model twice the size of actual model? Intermediate	6	2735	June 12, 2022
Model saving results in a small size checkpoint 🤗Transformers	1	634	January 4, 2021
OOM issues with exported vs. model card models Models	1	298	March 9, 2021
OOM issues with save_pretrained models 🤗Transformers	0	1058	March 9, 2021

Why does local-downloaded model files are different from those in huggingface?

Related topics