Original transformers model implementation

Yes @Vasily
I used the vanilla transformer implemented in PyTorch.

https://pytorch.org/docs/stable/generated/torch.nn.Transformer.html