Hi! I was taking a look at Hugging Face’s DeBERTa implementation, but I couldn’t find where in the code are the absolute positions being added (they should come before the softmax layer), can someone point me towards the line where that is being done?
Thank you for your time
1 Like
cc @lysandre maybe?