Vanilla Transformer

Onlydrinkwater · May 19, 2022, 2:22am

Hi all,

Is the transformer model and tokenizer used in the paper ‘Attention is all you need’ available in HF?

I want to reproduce the result in the paper.
( ‘We use Transformer (Vaswani et al., 2017) as the basic model structure’)
They got 28.4 bleu score using the basic transformer model on en-de task.

Any help will be appreciated!

DevavratSinghBisht · June 6, 2023, 6:10am

Hi @Onlydrinkwater ,

Were you able to get the implementation of the model and tokenizer ?
If yes then can you please share it with me.
Also, if you were able to replicate the results, can you please share any tips and tricks to do the same.

Topic		Replies	Views
Reproduce attention is all you need Beginners	0	480	June 25, 2022
How to train a translation model from scratch to reproduce <attention is all you need>? Beginners	0	400	November 29, 2022
How to make pure transformer model Beginners	0	136	May 22, 2024
Original transformers model implementation Beginners	2	976	June 1, 2022
Has vanilla transformer implemented in transformers library? 🤗Transformers	3	1938	June 5, 2022

Vanilla Transformer

Related topics