Skip to main content
. 2020 Oct 12;48(19):11030–11039. doi: 10.1093/nar/gkaa863

Table 2.

Population identification accuracy for codon pairing

Population Total individuals Number of clusters Percent accuracy
East Asia 504 1 100
Africa 661 9 98.7897
South Asia 489 56 88.7526
Europe 503 91 82.1074
America 347 100 71.4697

Using an alignment-free phylogenomic algorithm that analyzes only codon pairing usages across a genome, individuals in the 1000 Genomes Projects were clustered in a phylogeny (i.e. similar to a pedigree). Cluster accuracy was determined based on the number of clusters (i.e. clades) of individuals belonging the same superpopulation, where a new cluster was formed when an individual from a different population was added to the cluster. The table shows the superpopulation, the number of individuals in that population, the number of clusters identified using the phylogenomic algorithm, and the percent accuracy of the classification.