The Encoder is on the left and the Decoder is on the right.
Both Encoder and Decoder are composed of modules that can be stacked on top of each other multiple times, which is described by Nx in the figure.
We see that the modules consist mainly of Multi-Head Attention and Feed Forward layers. The inputs and outputs (target sentences) are first embedded into an n-dimensional space since we cannot use strings directly.
Read more: https://lnkd.in/e2zj2Pg

Comments