Big Model Inference: CPU/Disk Offloading for Transformers Using from_pretrained

Yes. Just set device_map="auto" in your call and it’ll do this automatically

2 Likes