Downloading and storing models

rachith · October 18, 2022, 8:09pm

Following this blog post

I download the OPT175B model using

model = AutoModelForCausalLM.from_pretrained("bigscience/bloom", device_map="balanced_low_0", torch_dtype=torch.float16,cache_dir=<path to driec>)

I can see a 350GB file (it took quite sometime to download but that is okay) created after in the <path to direct>.

However, upon restarting the session, two behaviors are observed

model = AutoModelForCausalLM.from_pretrained("bigscience/bloom", device_map="balanced_low_0", torch_dtype=torch.float16,cache_dir=<path to driec>) quickly loads the model (i.e., does not redownload the 350gb file) since it is found in <path to driec>. This is the desirable behavior
However, trying generator = pipeline("text-generation", model="bigscience/bloom", cache_dir=<path to driec>,device_map="balanced_low_0", torch_dtype=torch.float16) begins redownaloding the model and I am not sure why.

Moreover the files in <path to direc> have blobs, refs, snapshots in it but not a .json file which probably what the pipeline() in searching for.

My questions are

how to make pipeline() use the downloaded model instead of re-downloading
why isn’t from_pretrained() downloading the .json file?

One way I read was to use model.save_pretrained(<path>) and then use pipeline(<path>) to load but this takes an extra 350gb of space since model.save_pretrained(<path>) will create a new model with .json file in it.

Any suggestions?

rachith · October 18, 2022, 8:40pm

Oooh I think I figured it out.

I think the correct way to use pipeline() is to do something like the following -
model = AutoModelForCausalLM.from_pretrained("bigscience/bloom", device_map="balanced_low_0", torch_dtype=torch.float16,cache_dir=<path to driec>)
followed by
token = AutoTokenizer.from_pretrained("bigscience/bloom", cache_dir="/home/racball/opt175b_tokeniser")

Then,
generator = pipeline("text-generation", model=model,tokenizer=token)

Topic		Replies	Views
Not able to download HF models when using from_pretrained function Beginners	5	5085	April 7, 2023
Saving a model and loading it Models	3	57918	July 5, 2024
Is there an API to re-download a model? Beginners	5	1846	January 11, 2024
Manually Downloading Models in docker build with snapshot_download 🤗Transformers	2	16911	December 5, 2022
Size of saved model: Is there a way to make it smaller for deploy? Beginners	1	594	July 27, 2023

Downloading and storing models

Related topics