Accuracy drops using Gradient checkpointing

Hi,
I am not able to replicate same performance as without using gradient checkpointing.
With Gradient ckp eval metrics is half without gradient ckp. I use lora adpaters . Am i missing any thing ?