How is the encoding done for transformers? What encoder is used?

rwheel · August 25, 2022, 5:26am

I’ve just found a thread that can be of your interest https://discuss.huggingface.co/t/transformer-architecture-and-theory/14558/2.

On the other hand, it comes to my mind two additional resources: the paper of Attention is all you need and the book Natural Language Processing with Transformers, in which you can find good diagrams that explain the encoder. There is a repository on github of that book. Although all chapters are not released, you can see the images and some code (https://github.com/nlp-with-transformers/notebooks/blob/main/03_transformer-anatomy.ipynb).

Topic		Replies	Views
How to stop at 512 tokens when sending text to pipeline? 🤗Transformers	2	1479	February 7, 2024
Max length transformers problem 🤗Transformers	0	128	March 4, 2023
Truncate the seq. not working 🤗Transformers	0	838	August 17, 2022
Tokenizer taking lot of memory 🤗Transformers	3	3494	April 16, 2023
Token indices sequence length is longer than the specified maximum sequence length for this model 🤗Transformers	1	5564	July 21, 2023

How is the encoding done for transformers? What encoder is used?

Related topics