Skip to main content
. Author manuscript; available in PMC: 2017 Mar 21.
Published in final edited form as: Nature. 2016 Sep 21;538(7624):201–206. doi: 10.1038/nature18964

Extended Data Figure 2. Worldwide variation in human short tandem repeats.

Extended Data Figure 2

A: Mean STR length is reported as the average of the length difference (in base pairs) from the GRCh37 reference for each genotype. Bubble area scales with the number of calls compared at each point. B: and C: show the first two principal components after performing principal component analysis on tetranucleotide and homopolymer genotypes, respectively. Colors represent the region of origin of each sample. D: Pairwise FST values between populations computed using only SNPs vs. using combined SNP+STR loci. E: Block jackknife standard errors for the SNP vs. SNP+STR FST analysis. The red dashed lines give the best-fit line, described by the formula in red. The black dashed line denotes the diagonal.