Encoder-Decoder vs Decoder Only Architecture Models

sisilmet2000 · December 18, 2022, 7:27pm

Transformers originally started with the Encoder-Decoder models for solving the machine translation tasks. Since then Decoder only transformer models have emerged as strong contenders for 1) translation 2) better generalization for downstream tasks 3) host of application from classification to translation to generation.

When should we consider a encoder-decoder style architecture vs a decoder only architecture?
In what cases can a encoder-decoder architecture outperform a decoder only architecture?

Thanks

Topic		Replies	Views
Unified interface for encoder-decoder and decoder-only translation Beginners	0	236	December 14, 2022
What architecture to use to classify english to japanese translations Beginners	2	650	July 14, 2023
Fine-tuning Decoder-only or Encoder-Decoder models for classification 🤗Transformers	0	729	July 17, 2024
Decoder only fine-tuning enough for UMT5 Models	0	336	November 29, 2023
Decoder vs Encoder-decoder clarification Beginners	3	12434	August 1, 2023

Encoder-Decoder vs Decoder Only Architecture Models

Related topics