Figure 4.
The architecture of the encoder in a transformer neural network. A transformer is a deep learning model that adopts the mechanism of self-attention, differently weighing the significance of each part of the input data. The encoder's job is to process the input data, such as a sentence in a source language, and convert it into a high-dimensional representation.
