Difference between GAT and Transformer?

What is the difference between GAT and Transformer?

I read GAT gives local attention, whereas Transformers give global attention.

Can anyone provide any intuition when GAT would outperform Transformers and vice versa. Also for what task GAT would be more suitable than Transformers.

1 Like