Chapter 1 questions

In the encoder-decoder architecture, the decoder looks only backwards, i.e., at the preceding tokens, as does the decoder-only arch. The encoder-decoder arch seems to be more powerful just because there’s a whole additional component (the encoder). If this is true then why not using encoder-decoder arch for everything that is currently done by the decoder-only arch (e.g., text generation)?