Fig. 4. Demographics of SARS-CoV-2.
a Top panel shows a phylogenetic tree of 3852 SARS-CoV-2 genomes sampled globally between December 2019 and March 2021. The bottom panel shows the geographic distribution of the major clades of SARS-CoV-2. Clades were defined using Nextstrain nomenclature based on global frequency, variation from parent clade, and year of emergence. The relative global frequency of b all major SARS-CoV-2 clades, c the amino acid variant at position 614 of the spike protein (D: aspartic acid and G: glycine), d the amino acid variant at position 452 of the spike protein (L: leucine and R: arginine), and e the amino acid variant at position 501 of the spike protein (N: asparagine, Y: tyrosine and T: threonine). For a, b clades were named according to Nextstrain nomenclature, which distinguishes clades based on global frequency, year of emergence and a unique letter. For a–e data visualization was performed by nextstrain.org with data provided by GISAID.