AutoModelForCausalLM and transformers.pipeline

alice86 · August 29, 2024, 7:37am

I already download the model and directly use the path.

model_id = "./Llama3"

pipeline = transformers.pipeline(
    "text-generation", model=model_id, model_kwargs={"torch_dtype": torch.bfloat16}, device_map="auto"
)

It show the error

ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Q1.
What is the define for You are trying to offload the whole model to the disk
If I use the model_id=meta-llama/Meta-Llama-3-8B .
It will automatically download the folder models–meta-llama–Meta-Llama-3-8B on ./cache

So the download model is also the case that offload the whole model to the disk ?

Q2. Because I saw the offload the whole model.
what is the onlineload the model code

Topic		Replies	Views
How to load an AutoNLP model 🤗AutoTrain	1	861	June 30, 2022
ValueError: Please use the `disk_offload` function instead Beginners	1	1087	August 21, 2024
Downloading and storing models 🤗Accelerate	1	3460	October 18, 2022
HuggingFacePipeline Llama2 load_in_4bit from_model_id the model has been loaded with `accelerate` and therefore cannot be moved to a specific device 🤗Accelerate	2	7182	October 9, 2024
`text-generation` `Pipeline` prohibitively slow to load, even with cached model 🤗Transformers	1	4441	May 23, 2023

AutoModelForCausalLM and transformers.pipeline

Related topics