How to correctly use model weights outside of forward in distributed training set-up with Accelerate?

marghovo · November 12, 2024, 5:17pm

I am using Accelerate to set up a distributed training script where I need to use some of the weights of my model outside of the forward method. What is the best practice to do this in order to avoid making a parameter ready twice?

Topic		Replies	Views
Loading weights straight to GPU & Training support 🤗Accelerate	0	215	September 18, 2023
How to only load model weights for the evalaution script? 🤗Accelerate	1	453	March 13, 2023
Decreasing performance when using Accelerate 🤗Accelerate	1	2311	March 8, 2022
Does accelerate.prepare() destroy model weights even if --model_name_or_path is specified and model is loaded? 🤗Accelerate	1	727	June 23, 2023
Multi-GPU Distributed Training using Accelerate on Windows 🤗Accelerate	0	1552	August 9, 2023

How to correctly use model weights outside of forward in distributed training set-up with Accelerate?

Related topics