Skip to main content
. 2024 Nov 8;14:27242. doi: 10.1038/s41598-024-76781-4

Fig. 1.

Fig. 1

Self-attention vs. sparse attention patterns (Dark blue squares represent tokens attending to themselves, light blue squares indicate attention computations between the corresponding dark blue square token and other tokens).