Skip to main content
. 2024 Mar 27;25:131. doi: 10.1186/s12859-024-05648-2

Table 1.

Number of collected SARS-CoV-2 genomes in a) the main dataset (n = 1,131,185) b) the validation dataset (n = 67,399)

Clades SARS-CoV-2 genomes Continents SARS-CoV-2 genomes
(a) The main dataset (n = 1,131,185)
Clade_G 163,511 (14.45%) Africa 17,986 (1.59%)
Clade_GH 162,666 (14.38%) Asia 87,711 (7.75%)
Clade_GK 154,275 (13.6%) Europe 576,936 (51.00%)
Clade_GR 162,619 (14.37%) North America 389,136 (34.4%)
Clade_GRA 159,190 (14.07%) Oceania 10,761 (0.951%)
Clade_GRY 170,070 (15%) South America 43,548 (3.84%)
Clade_GV 158,854 (14%) Unknown 5107 (0.45%)
(b) The validation dataset (n = 67,399)
Clade_G 3161 (4.68%) Africa 2225 (3.3%)
Clade_GH 6169 (9.15%) Asia 12,145 (18%)
Clade_GK 22,436 (33.28%) Europe 28,940 (42.93%)
Clade_GR 10,536 (15.63%) North America 13,784 (20.35%)
Clade_GRA 17,844 (26.47%) Oceania 1781 (2.64%)
Clade_GRY 6591 (9.77%) South America 6761 (10.03%)
Clade_GV 662 (0.98%) Unknown 1763 (2.61%)