Performing gradient accumulation with Accelerate

sumorday · March 4, 2024, 4:07am

            accelerator.accumulate(model)
            loss = loss.mean()
            # change above from here
            accelerator.backward(loss)
            optimizer.step()
            scheduler.step()
            model.zero_grad()
            global_step += 1
            log_steps += 1
            optimizer.zero_grad()
            # change above from here

is this correctly? Thank you in advance

Topic		Replies	Views
Using gradient_accumulation_steps does not give the same results 🤗Accelerate	0	516	February 18, 2023
Any incompatibility of gradient_accumulation with the streaming data? 🤗Transformers	0	251	July 10, 2023
Questions about steps with gradient accumulation Beginners	1	1027	July 19, 2023
[Question] How to optimize two loss alternately with gradient accumulation? 🤗Accelerate	4	1671	September 11, 2023
How to obatin gradients on different GPUs to do custom accumulations Intermediate	0	283	September 2, 2023

Performing gradient accumulation with Accelerate

Related topics