Skip to main content
. 2022 Feb 11;23:121. doi: 10.1186/s12864-022-08358-2

Fig. 3.

Fig. 3

Histogram of the proportion of spike protein nucleotide sites that are ambiguous (i.e. contain at least one IUPAC ambiguity code). The distribution is calculated for all SARS-CoV-2 sequences in GISAID that (i) have been designated to a Pango lineage, and (ii) have N at < 5% of sites across the whole genome, excluding UTRs