Has vanilla transformer implemented in transformers library?

Sumsky21 · May 6, 2022, 9:32am

Hi all,

Recently I’m going to do a translation task using the vanilla Transformer model in the paper “Attention is all you need”, but when I search for model list in Transformers docs, there seems no origin transformer. Anyone know where I can find the model for translation task, or I have to implement it manually?

I know that pytorch library has a Transformer class, but it seems that class hasn’t embedding module and isn’t specified for particular task (like BertForxxx etc.).

BramVanroy · May 6, 2022, 9:35am

You may prefer to use libraries dedicated for that instead. For instance OpenNMT, fairseq, MarianMT. The latter has also been implemented in transformers.

Vasily · May 31, 2022, 1:25pm

@BramVanroy I wonder is it possible to use a vanilla Transformer encoder with no any pre-training?

BramVanroy · June 5, 2022, 11:09am

What do you mean? If you do not train a model, it’s weights are randomly initialized. That means that without training, the model will give you random output. You can make use of models that someone else pretrained, though. Models - Hugging Face

Topic		Replies	Views
Original transformers model implementation Beginners	2	995	June 1, 2022
Way to train a basic Transformer Beginners	6	646	November 21, 2020
How pretrained models are trained? Beginners	3	273	October 2, 2020
Reproduce attention is all you need Beginners	0	490	June 25, 2022
Vanilla Transformer Beginners	1	1182	June 6, 2023

Has vanilla transformer implemented in transformers library?

Related topics