Strange error when using the Longformer (HuggingFace developers, please reply)

in my case I am struggling with cuda out of memory with longformer, I am using Google Colab pro