Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2019 May 20;116(23):11275–11284. doi: 10.1073/pnas.1816707116

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

Published under the PNAS license.

PMC Copyright notice

Fig. 6. — Sequence properties of consensus sequences. (A) Z scores (the number of SDs that separate the consensus sequence from the mean value of sequences in the MSAs; SI Appendix, Eq. 3) for various sequence properties. Distributions for all protein families are shown in SI Appendix, Fig. S11. (B) Differences between residue frequencies in the consensus sequences and the MSAs averaged over all seven protein families. Residues are colored as follows: polar charged (red), polar uncharged (blue), and nonpolar (black). The vertical offset is used for clarity. (C) Distributions of sequence entropy values for all positions in the PGK MSA (purple), positions at which residues in extant sequences differ from the consensus sequence (consensus mismatches; red), and positions at which residues in extant sequences match the consensus sequence (consensus matches; blue) for PGK. Sequence entropy distributions for all protein families are shown in SI Appendix, Fig. S13. (D) Ratios of conditional probabilities of different structural environments (surface, intermediate, and buried; “X” in the y label) for consensus mismatches relative to overall probabilities of surface, intermediate, and buried residues at all positions. Conditional and overall probabilities for all protein families are shown in SI Appendix, Fig. S14. The legend is as in A.