Skip to main content
. 2021 Apr 8;22(5):bbab060. doi: 10.1093/bib/bbab060

Figure 1 .


Figure 1

Illustration of the data flow within the transformer architecture. The full genome is processed in sequential segments Inline graphic with length Inline graphic. First, the input nucleotide is transformed into a vector embedding Inline graphic, after which it is processed by Inline graphic consecutive residual blocks (Inline graphic). A set of fully connected layers transforms Inline graphic into a model output Inline graphic. For the calculation at each residual block, the upstream Inline graphic hidden states of the previous layer are applied (brown gradient). For example, the calculation of Inline graphic is based on the hidden states Inline graphic]. Hidden states from the previous segment (Inline graphic) are made accessible for the calculation of the hidden states in segment Inline graphic.