Figure 1.
Global Genomic Epidemiology of Novel Coronavirus Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2).
(A) Phylogenetic analysis of 3485 genomes of SARS-CoV-2 sequences globally from December 2019 to December 2020. The different genetic variants of circulating SARS-CoV-2 are grouped into five clades defined by specific signature mutations showing their global distribution on the time scale. The clades 19A and 19B dominated the early outbreak in Wuhan and represent a higher proportion in Asia. Clades 20A, 20B, and 20C dominate in Europe and North America. (B) Geographical distribution of genomes. Each circle is centered on an individual country. The color indicates the region, and the size (area) of the circle represents the number of genomes from that country. (C) A ‘diversity’ panel that shows the novel coronavirus genome, its genes, and sites of amino acid mutations. (D) Subsection of subfigure (C) highlighting the mutation pattern in the 25 400–29 800 bp range of the genome. Apart from the spike (S) region, the genomic regions of open reading frame (ORF)14, ORF9b, ORF8, and ORF3a appear to be highly variable between clinical isolates of SARS-CoV-2. Source: latest global SARS-CoV-2 updated daily at https://nextstrain.org/sars-cov-2. Abbreviations: E, envelope; M, membrane.