Reproduce attention is all you need

Onlydrinkwater · June 25, 2022, 12:41am

Is there any pretrained model implemented by HF which have the exactly the same structure as the vanilla transformer? so that I can just set the config file for that model and reproduce the result in paper ‘attention is all you need’.

Any help would be appreciated!

Topic		Replies	Views
Vanilla Transformer Beginners	1	1177	June 6, 2023
Adding cross-attention to custom models 🤗Transformers	2	3534	October 21, 2022
How to train a translation model from scratch to reproduce <attention is all you need>? Beginners	0	400	November 29, 2022
How to avoid downloading models Beginners	0	797	June 25, 2023
How to make pure transformer model Beginners	0	136	May 22, 2024

Reproduce attention is all you need

Related topics