Skip to main content
. 2023 Feb 8;15:18. doi: 10.1186/s13321-023-00686-z

Fig. 1.

Fig. 1

Architecture of the used transformer model. Encoder and decoder layers are constructed following the original publication of the transformer model by Vaswani et al. [34]. To help conserve similarities in latent space, a special loss function denoted as ”similarity loss” is added to the reconstruction loss