Want to use CPU for falcon7b

tolkoton · June 22, 2023, 11:04am

My system runs out of memory on GPU. But I want just to test if it works and want to load it on CPU. How do I do that?
Adding ‘’‘device = torch.device(‘cpu’)’‘’ before loading model doesn’t help
It crashes here:

model = AutoModelForCausalLM.from_pretrained(
    config.base_model_name_or_path,
    quantization_config=bnb_config,
    device_map="auto",
    trust_remote_code=True,
)

Topic		Replies	Views
Inference with CPU offload 🤗Accelerate	0	1609	August 10, 2023
Using 2 GPUs out of 4 Beginners	0	274	February 28, 2024
Move model with device_map="balanced" to CPU 🤗Transformers	1	6218	February 5, 2024
Memory issues with 3090 and 7b model Beginners	0	60	July 31, 2024
Simple example run takes 5+ minutes on rtx3060 - falcon7B Beginners	1	494	February 18, 2024

Want to use CPU for falcon7b

Related topics