Figure 3.

Variability within 54 2019‐nCoV full genomic sequences. A, location of major structural protein‐encoding genes (red boxes; S = Spike protein, E = Envelope protein, M = Membrane protein, N = Nucleocapsid protein) and accessory protein ORFs (blue boxes) on the meta‐genomic sequence derived from the MSA of all genomes. B, Shannon entropy values across genomic locations. The two coordinates with the highest entropy (excluding the 5′ and 3′ highly variable UnTranslated regions) are indicated. C, Zoom‐in of the MSA describing the two most variable locations in the core genome, in the ORF1ab (left) and in ORF8 (right). MSA, multiple sequence alignment