Chapter 1 questions

dmitrijsk · August 17, 2023, 11:59am

In the encoder-decoder architecture, the decoder looks only backwards, i.e., at the preceding tokens, as does the decoder-only arch. The encoder-decoder arch seems to be more powerful just because there’s a whole additional component (the encoder). If this is true then why not using encoder-decoder arch for everything that is currently done by the decoder-only arch (e.g., text generation)?

Topic		Replies	Views
Chapter 7 questions Course	119	10411	July 10, 2025
Bert2bert translator? 🤗Transformers	6	44	August 28, 2025
Chapter 3 questions Course	149	10533	August 29, 2025
Encoder-Decoder model only generates bos_token's [<s><s><s>] Models	17	3176	December 6, 2022
EncoderDecoderModel for token classification 🤗Transformers	0	194	October 29, 2022

Chapter 1 questions

Related topics