Memory requirements for Zeroscope backprop

Hi, what are the memory requirements for backpropagation through zeroscope model (TextToVideoSDPipeline)? I am getting out of memory problems when backprop with a batch size >=2 (using 80GB A100 card)