Does pipline with accelerate use "with init_empty_weights():"?

vivek9840 · April 7, 2024, 11:37am

i am using pipeline to infer LLMs with 7B to 70B on kaggle and my local setup.

i saw the application of accelerate for infering large models. and i saw the base.py and utils of pipeline from huggigface.

i didn’t find the “with init_empty_weights():” which is used for infering big models.

does it not use it or it is used but nt written in those files.

if it not written, can i load model with “with init_empty_weights():” and pass that model in pipeline with same device as i load using load_checkpoint_and_dispatch() and tie_weight()?

marcsun13 · April 15, 2024, 12:18pm

Hi @vivek9840, you can’t do the following:

if it not written, can i load model with “with init_empty_weights():” and pass that model in pipeline with same device as i load using load_checkpoint_and_dispatch() and tie_weight()?

However, you can pass device_map=“auto” in pipeline. It will use init_empty_weights and more to load your model on your gpu(s). With this arg, you will be able to perform big model inference as described Handling big models for inference.

vivek9840 · April 15, 2024, 3:52pm

thanks for your reply. i was also curious since i didn’t see the line where it uses init_empty_weights. i tried to see the dile in pipeline and accelerate but i didn’t saw the use of init_empty_weights.

system · April 16, 2024, 3:53am

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using loaded model with accelerate for inference 🤗Accelerate	3	1995	November 4, 2022
How to only load model weights for the evalaution script? 🤗Accelerate	1	449	March 13, 2023
Initialize model with empty weight causes OOM with offloading to disk Beginners	0	28	February 1, 2025
Error: The model weights are not tied. Please use the `tie_weights` method before using the `infer_auto_device` function, even after adding model.tie_weights() Beginners	8	10876	February 20, 2024
Initializing a big model on GPU with random weights 🤗Transformers	2	69	January 14, 2025

Does pipline with accelerate use "with init_empty_weights():"?

Related topics