Gradient checkpointing without training

Mike5645 · July 18, 2023, 8:51pm

I have a LlamaForCausalLM model. I want to do a single run of backprop on a single sample (one forward pass, one backward pass) and record all the gradients that are computed in the process. I do not want to actually update the model weights- I just want to record the gradients. The model is pretty big and I only have a single GPU, so to be able to do this I need to use gradient checkpointing. Is there a way to use a Trainer to accomplish this? Thanks!

Topic		Replies	Views
Gradient_checkpointing control 🤗Transformers	0	1161	August 10, 2023
Using gradient_checkpointing=True in Trainer causes error with LLaMA 🤗Transformers	1	2565	July 8, 2023
Can we use Gradient Checkpointing and Gradient Accumulation at Once? 🤗Transformers	1	1234	September 14, 2021
Is there a way to backpropagate through multiple steps while using Trainer API 🤗Transformers	1	255	July 9, 2021
Accuracy drops using Gradient checkpointing 🤗Transformers	0	160	September 7, 2023

Gradient checkpointing without training

Related topics