Close up view of specific clusters in human proteome UMAP (shown in
Figure 3A), with several LCR sequences and their parent proteins annotated. For all LCRs shown, the subscript at the end of the sequence corresponds to the ending position of the LCR in the sequence of its parent protein. (
A) Close-up view of S-rich Leiden cluster (bottom of UMAP in
Figure 3A). For LCRs along bridges connecting to leiden clusters of other amino acids, the residues of that other amino acid are underlined. For example, the LCR from ACRC lies in the bridge between the S and D clusters, so the D residues are underlined to highlight their frequency. (
B) Close-up view of P-rich, G/P-rich, and G-rich Leiden clusters (right side of UMAP in
Figure 3A). (
C) Close-up view of K-rich, E-rich, and D-rich Leiden clusters (left side of UMAP in
Figure 3A).