Skip to main content
. 2023 Aug 9;10(8):948. doi: 10.3390/bioengineering10080948

Figure 4.

Figure 4

Self-attention block. X indicates the input images, and y indicates the output images of the self-attention layer. Input size of the image is H × W × C. H, W, and C represent the height, the width, and the output channel of the image, respectively. N = H × W. The different shade colors in N × N indicates the obtained attention coefficient (0–1), and dark colors indicates higher attention coefficient.