hi all,
does anyone know why transformer models are called transformer models?
Is it related to some kind of meme like the inception module?
Because they were a radical new architecture, compared to RNNs and CNNs, i.e. they transformed the architecture landscape.
1 Like
really?